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PREFACE 



The theory of equations is not only a necessity in the subsequent mathe- 
matical courses and their applications, but furnishes an illuminating sequel to 
geometry, algebra and analytic geometry Moreover, it develops anew and in 
greater detail various fundamental ideas of calculus for the simple, but impor- 
tant, case of polynomials. The theory of equations therefore affords a useful 
supplement to differential calculus whether taken subsequently or simultane- 
ously 

It was to meet the numerous needs of the student in regard to his earlier and 
future mathematical courses that the present book was planned with great care 
and after wide consultation. It differs essentially from the author's Elementary 
Theory of Equations, both in regard to omissions and additions, and since it 
is addressed to younger students and may be used parallel with a course in 
differential calculus. Simpler and more detailed proofs are now employed. 
The exercises are simpler, more numerous, of greater variety, and involve more 
practical applications. 

This book throws important light on various elementary topics. For ex- 
ample, an alert student of geometry who has learned how to bisect any angle 
is apt to ask if every angle can be trisected with ruler and compasses and if 
not, why not. After learning how to construct regular polygons of 3, 4, 5, 6, 
8 and 10 sides, he will be inquisitive about the missing ones of 7 and 9 sides. 
The teacher will be in a comfortable position if he knows the facts and what 
is involved in the simplest discussion to date of these questions, as given in 
Chapter III. Other chapters throw needed light on various topics of algebra. In 
particular, the theory of graphs is presented in Chapter V in a more scientific 
and practical manner than was possible in algebra and analytic geometry. 

There is developed a method of computing a real root of an equation with 
minimum labor and with certainty as to the accuracy of all the decimals ob- 
tained. We first find by Horner's method successive transformed equations 
whose number is half of the desired number of significant figures of the root. 
The final equation is reduced to a linear equation by applying to the con- 
stant term the correction computed from the omitted terms of the second and 
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higher degrees, and the work is completed by abridged division. The method 
combines speed with control of accuracy. 

Newton's method, which is presented from both the graphical and the 
numerical standpoints, has the advantage of being applicable also to equations 
which are not algebraic; it is applied in detail to various such equations. 

In order to locate or isolate the real roots of an equation we may employ a 
graph, provided it be constructed scientifically, or the theorems of Descartes, 
Sturm, and Budan, which are usually neither stated, nor proved, correctly. 

The long chapter on determinants is independent of the earlier chapters. 
The theory of a general system of linear equations is here presented also from 
the standpoint of matrices. 

For valuable suggestions made after reading the preliminary manuscript of 
this book, the author is greatly indebted to Professor Bussey of the University 
of Minnesota, Professor Roever of Washington University, Professor Kempner 
of the University of Illinois, and Professor Young of the University of Chicago. 
The revised manuscript was much improved after it was read critically by 
Professor Curtiss of Northwestern University. The author's thanks are due 
also to Professor Dresden of the University of Wisconsin for various useful 
suggestions on the proof-sheets. 

Chicago, 1921. 
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CHAPTER I 

Complex Numbers 

1. Square Roots. If p is a positive real number, the symbol ^/p is used to 
denote the positive square root of p. It is most easily computed by logarithms. 

We shall express the square roots of negative numbers in terms of the 
symbol i such that the relation i 2 = —1 holds. Consequently we denote the 
roots of x 2 = —1 by i and — i. The roots of x 2 = —4 are written in the form 
±2i in preference to ±\/— 4. In general, if p is positive, the roots of x 2 = —p 
are written in the form ±^/pi in preference to ± A / = p. 

The square of either root is thus (y/p) 2 i 2 = —p. Had we used the less desirable 
notation zc-^—p for the roots of x 2 = —p, we might be tempted to find the square of 
either root by multiplying together the values under the radical sign and conclude 
erroneously that 

\/—p V^-p = vP 2 = +P- 

To prevent such errors we use ^fpi and not \f—p. 

2. Complex Numbers. If a and b are any two real numbers and i 2 = —1, 
a + bi is called a complex number 1 and a — bi its conjugate. Either is said to 
be zero if a = b = 0. Two complex numbers a + bi and c + di are said to be 
equal if and only if a = c and b = d. In particular, a + bi = if and only if 
a = b = 0. If b / 0, a + bi is said to be imaginary. In particular, bi is called a 
pure imaginary. 



1 Complex numbers are essentially couples of real numbers. For a treatment from this 
standpoint and a treatment based upon vectors, see the author's Elementary Theory of 
Equations, p. 21, p. 18. 
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Addition of complex numbers is denned by 

(a + bi) + (c + di) = (a + c) + (6 + d)i 

The inverse operation to addition is called subtraction, and consists in finding 
a complex number z such that 

(c + di) + 2 = a + bi. 

In notation and value, z is 

(a + bi) - (c + di) = (a - c) + (6 - d)i. 

Multiplication is defined by 

(a + bi)(c + di) = ac — bd+ (ad + bc)i, 

and hence is performed as in formal algebra with a subsequent reduction by 
means of i 2 = — 1. For example, 

(a + W)(o - bi) = a 2 - 6 2 i 2 = a 2 + 6 2 . 

Division is defined as the operation which is inverse to multiplication, and 
consists in finding a complex number q such that (a + bi)q = e + fi. Multiplying 
each member by a — bi, we find that q is, in notation and value, 

e + fi (e + fi)(a — bi) ae + bf af — be . 
a + bi a 2 + b 2 a 2 + b 2 a 2 + b 2 

Since a 2 + b 2 = implies a = b = when a and b are real, we conclude that 
division except by zero is possible and unique. 

EXERCISES 

Express as complex numbers 

1. v^- 2. VI. 

3. (V25 + v /3 25)v /r 16. 4. -§. 

5.8 + 273. 6 3 + ^ 7. 3 + 5*. 8. ^. 

2 + 7— T 2 — 3i a — bi 

9. Prove that the sum of two conjugate complex numbers is real and that their 
difference is a pure imaginary. 
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10. Prove that the conjugate of the sum of two complex numbers is equal to the 
sum of their conjugates. Does the result hold true if each word sum is replaced by 
the word difference? 

11. Prove that the conjugate of the product (or quotient) of two complex numbers 
is equal to the product (or quotient) of their conjugates. 

12. Prove that, if the product of two complex numbers is zero, at least one of 
them is zero. 

13. Find two pairs of real numbers x, y for which 

(x + yi) 2 = -7 +24i. 

As in Ex. 13, express as complex numbers the square roots of 

14. -ll + 60z. 15. 5 — 12*. 16. Acd + (2c 2 - 2d 2 )i. 



3. Cube Roots of Unity. Any complex number x whose cube is equal 
to unity is called a cube root of unity. Since 

x 3 - 1 = (x - l)(x 2 + x+ 1), 

the roots of x 3, = 1 are 1 and the two numbers x for which 

x 2 + x + l = 0, (x + \) 2 = -§, x + \ = ±\\/Zi. 

Hence there are three cube roots of unity, viz., 

1, ui = — j + jVoi, u/ = — ^ — j V Si. 

In view of the origin of u, we have the important relations 

uj 2 +u)+ 1 = 0, uj S = 1. 

Since uu 1 = 1 and w 3 = 1, it follows that u/ = uj 2 , uj = uj' 2 . 

4. Geometrical Representation of Complex Numbers. Using rect- 
angular axes of coordinates, OX and OY, we represent the complex number 
a + bi by the point A having the coordinates a, b (Fig. 1). 
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The positive number r = Va 2 + b 2 giving 
the length of OA is called the modulus (or 
absolute value) of a+bi. The angle 9 = XOA, 
measured counter-clockwise from OX to OA, 
is called the amplitude (or argument) of a + 
bi. Thus cos 6* = a/r, s'mO = b/r, whence 



(1) 



a + bi = r(cos 9 + i sin 0) 




The second member is called the trigonometric form of a + 6i. 

For the amplitude we may select, instead of 9, any of the angles 6 ± 360°, 
(9 ±720°, etc. 

Two complex numbers are equal if and only if their moduli are equal and 
an amplitude of the one is equal to an amplitude of the other. 

For example, the cube roots of unity are 1 and 




LO 



+ hVli 



cos 120° + i sin 120° 



uj 



\\/l% 



cos 240° + ism 240°, 



and are represented by the points marked 1, u>, to 2 
at the vertices of an equilateral triangle inscribed 
in a circle of radius unity and center at the ori- 
gin O (Fig. 2). The indicated amplitudes of u 
^ IG - ^ and to 2 are 120° and 240° respectively, while the 

modulus of each is 1. 
The modulus of —3 is 3 and its amplitude is 180° or 180° plus or minus the 
product of 360° by any positive whole number. 



5. Product of Complex Numbers. By actual multiplication, 

[r(cos 9 + i sin 9)~\ [r (cos a + i sin a)] 
= rr [(cos 9 cos a — sin 9 sin a) + i(sin 9 cos a + cos 9 sin a)] 
= rr' [cos(# + a) + ism{9 + a)], by trigonometry. 

Hence the modulus of the product of two complex numbers is equal to the prod- 
uct of their moduli, while the amplitude of the product is equal to the sum of 
their amplitudes. 

For example, the square of u = cos 120° + i sin 120° has the modulus 1 and 
the amplitude 120° + 120° and hence is J 2 = cos 240° + i sin 240°. Again, the 
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product of lu and to 2 has the modulus 1 and the amplitude 120° + 240° and hence 
is cos 360° + i sin 360°, which reduces to 1. This agrees with the known fact that 
to 3 = 1. 

Taking r = r' = 1 in the above relation, we obtain the useful formula 

(2) (cos 9 + % sin 9) (cos a + i sin a) = cos(9 + a) + i sm(9 + a) . 

6. Quotient of Complex Numbers. Taking a = [3 — 9 in (2) and di- 
viding the members of the resulting equation by cos 9 + ism 9, we get 

cos (3 + isin (3 . . . 

, , = cos(/3 -9) + i sm(/3 - 0). 

cos 9 + i sin y 

Hence the amplitude of the quotient of R{cos (3+ism f3) by r (cos 9 +is'm 9) is equal 
to the difference (3 — 9 of their amplitudes, while the modulus of the quotient is 
equal to the quotient R/r of their moduli. 
The case (3 — gives the useful formula 

cos 6* — isin#. 



cos 9 + i sin 9 

7. De Moivre's Theorem. // n is any positive whole number, 
(3) (cos 9 + i sin 9) n = cos n9 + i sin nO. 

This relation is evidently true when n = 1, and when n = 2 it follows from 
formula (2) with a = 9. To proceed by mathematical induction, suppose that 
our relation has been established for the values 1,2, ... , m of n. We can then 
prove that it holds also for the next value m + 1 of n. For, by hypothesis, we 
have 

(cos 9 + i sin 9) rn = cos m9 + i sin m9. 

Multiply each member by cos# + isin#, and for the product on the right sub- 
stitute its value from (2) with a = m6. Thus 

(cos 9 + i sin 9) m = (cos 9 + i sin 9) (cos m9 + i sin mff) , 
= cos(# + m9) + i sin((9 + m9) , 

which proves (3) when n — m+1. Hence the induction is complete. 
Examples are furnished by the results at the end of §5: 

(cos 120° + i sin 120°) 2 = cos 240° + i sin 240°, 
(cos 120° + i sin 120°) 3 = cos 360° + i sin 360°. 
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8. Cube Roots. To find the cube roots of a complex number, we first 
express the number in its trigonometric form. For example, 

4\/2 + 4\/2i = 8(cos45° + i sin 45°). 

If it has a cube root which is a complex number, the latter is expressible in 
the trigonometric form 

(4) r(cos9 + ism 9). 

The cube of the latter, which is found by means of (3), must be equal to the 
proposed number, so that 

r 3 (cos36> + ism 3(9) = 8(cos45° + ism 45°). 

The moduli r 3 and 8 must be equal, so that the positive real number r is 
equal to 2. Furthermore, 3$ and 45° have equal cosines and equal sines, and 
hence differ by an integral multiple of 360°. Hence 3$ = 45° + k ■ 360°, or 
9 = 15° + k ■ 120°, where k is an integer. 2 Substituting this value of 9 and the 
value 2 of r in (4), we get the desired cube roots. The values 0, 1, 2 of k give 
the distinct results 

Rl = 2(cosl5° +isinl5°), 
R 2 = 2(cosl35°+isml35°), 
R 3 = 2(cos255°+ism255°). 

Each new integral value of k leads to a result which is equal to R\, R 2 
or i?3. In fact, from k = 3 we obtain R\, from k = 4 we obtain R 2 , from k = 5 
we obtain R%, from fc = 6we obtain Ri again, and so on periodically. 

EXERCISES 

1. Verify that i?2 = uRi, R$ = uj 2 R\. Verify that R\ is a cube root of 
8(cos45° + i sin 45°) by cubing R\ and applying De Moivre's theorem. Why are 
the new expressions for R2 and R% evidently also cube roots? 

2. Find the three cube roots of —27; those of —i; those of to. 

3. Find the two square roots of i; those of — i; those of ui. 

4. Prove that the numbers cos 9-\- i sin 9 and no others are represented by points 
on the circle of radius unity whose center is the origin. 



2 Here, as elsewhere when the contrary is not specified, zero and negative as well as 
positive whole numbers are included under the term "integer." 
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5. If a + bi and c + di are represented by the points A and C in Fig. 3, prove that 
their sum is represented by the fourth vertex S of the parallelogram two of whose 
sides are OA and OC. Hence show that the modulus of the sum of two complex 
numbers is equal to or less than the sum of their moduli, and is equal to or greater 
than the difference of their moduli. 





6. Let r and r' be the moduli and 9 and a the amplitudes of two complex 
numbers represented by the points A and C in Fig. 4. Let U be the point on the 
x-axis one unit to the right of the origin O. Construct triangle OCP similar to 
triangle OUA and similarly placed, so that corresponding sides are OC and OU, CP 
and UA, OP and OA, while the vertices O, C, P are in the same order (clockwise or 
counter-clockwise) as the corresponding vertices O, U, A. Prove that P represents 
the product (§5) of the complex numbers represented by A and C. 

7. If a + bi and e + fi are represented by the points A and S in Fig. 3, prove 
that the complex number obtained by subtracting a+bi from e + fi is represented 
by the point C. Hence show that the absolute value of the difference of two complex 
numbers is equal to or less than the sum of their absolute values, and is equal to or 
greater than the difference of their absolute values. 

8. By modifying Ex. 6, show how to construct geometrically the quotient of two 
complex numbers. 



9. nth Roots. As illustrated in §8, it is evident that the nth roots of 
any complex number p(cos^4 + ism A) are the products of the nth roots of 
cos A + i sin A by the positive real nth root of the positive real number p (which 
may be found by logarithms). 

Let an nth root of cos A + i sin A be of the form 

(4) r(cos9 + ism 9). 

Then, by De Moivre's theorem, 

r n (cos n9 + i sin n9) = cos A + i sin A. 
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and 1 must be equal, so that the positive real number r is 
equal to 1. Since n0 and A have equal sines and equal cosines, they differ by 
an integral multiple of 360°. Hence nO = A + k ■ 360°, where k is an integer. 
Substituting the resulting value of and the value 1 of r in (4), we get 

/rN (A + k-360°\ . . fA + k- 360° 
(5) cos I I + ism 



n 1 \ n 



For each integral value of k, (5) is an answer since its nth power reduces to 
cos A + ism A by DeMoivre's theorem. Next, the value n of k gives the same 
answer as the value of k; the value n + 1 of k gives the same answer as the 
value 1 of k; and in general the value n + m of k gives the same answer as 
the value m of k. Hence we may restrict attention to the values 0, 1, . . . , n — 1 
of k. Finally, the answers (5) given by these values 0, 1, . . . , n — 1 of k are all 
distinct, since they are represented by points whose distance from the origin 
is the modulus 1 and whose amplitudes are 

A A 360° A 2 • 360° A (n - 1)360° 



n n n n n n n 

so that these n points are equally spaced points on a circle of radius unity. 
Special cases are noted at the end of §10. Hence any complex number different 
from zero has exactly n distinct complex nth roots. 

10. Roots of Unity. The trigonometric form of 1 is cos 0°+i sin 0°. Hence 

by §9 with A = 0, the n distinct nth roots of unity are 

, > 2kn 2kn 

(6) cos hisin (k = 0, 1, . . . ,n — 1), 

n n 

where now the angles are measured in radians (an angle of 180 degrees being 
equal to it radians, where tv = 3.1416, approximately). For k = 0, (6) reduces 
to 1, which is an evident nth root of unity. For k = 1, (6) is 

,_ 2tt 2tt 

(7) it = cos Msin — . 

n n 

By De Moivre's theorem, the general number (6) is equal to the fcth power 
of R. Hence the n distinct nth roots of unity are 

(o\ p p2 r>3 n n ~ 1 T) n 1 

\o) It, ti , ri , . . . , ri ,-K — I- 

As a special case of the final remark in §9, the n complex numbers (6), and 
therefore the numbers (8), are represented geometrically by the vertices of a 
regular polygon of n sides inscribed in the circle of radius unity and center at 
the origin with one vertex on the positive x-axis. 



ill. 



PRIMITIVE ROOTS OF UNITY 




Fig. 5 



For n = 3, the numbers (8) are u, uj 2 , 1, which are 
represented in Fig. 2 by the vertices of an equilateral tri- 
angle. 

For n = 4, R = cos7r/2 + isin7r/2 = i. The four fourth 
roots of unity (8) are i, i 2 = — 1, i 3 = —i, i — 1, which 
are represented by the vertices of a square inscribed in a 
circle of radius unity and center at the origin O (Fig. 5). 

EXERCISES 



1. Simplify the trigonometric forms (6) of the four fourth roots of unity. Check 
the result by factoring x 4 — 1. 

2. For n — 6, show that R = —uj 2 . The sixth roots of unity are the three cube 
roots of unity and their negatives. Check by factoring x 6 — 1. 

3. From the point representing a + bi, how do you obtain that representing 
— (a + bi)? Hence derive from Fig. 2 and Ex. 2 the points representing the six sixth 
roots of unity. Obtain this result another way. 

4. Find the five fifth roots of —1. 

5. Obtain the trigonometric forms of the nine ninth roots of unity. Which of 
them are cube roots of unity? 

6. Which powers of a ninth root (7) of unity are cube roots of unity? 



11. Primitive nth Roots of Unity. An nth root of unity is called 
primitive if n is the smallest positive integral exponent of a power of it that 
is equal to unity. Thus p is a primitive nth root of unity if and only if p n = 1 
and p l / 1 for all positive integers I < n. 

Since only the last one of the numbers (8) is equal to unity, the number R, 
defined by (7), is a primitive nth root of unity. We have shown that the 
powers (8) of R give all of the nth roots of unity. Which of these powers of R 
are primitive nth roots of unity? 

For n = 4, the powers (8) of R = i were seen to be 



i, i 



-1, i J 



-i, i 



The first and third are primitive fourth roots of unity, and their exponents 1 and 3 
are relatively prime to 4, i.e., each has no divisor > 1 in common with 4. But the 
second and fourth are not primitive fourth roots of unity (since the square of —1 
and the first power of 1 are equal to unity), and their exponents 2 and 4 have the 
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divisor 2 in common with n = 4. These facts illustrate and prove the next theorem 
for the case n = 4. 

THEOREM. The primitive nth roots of unity are those of the numbers (8) 
whose exponents are relatively prime to n. 

Proof. If k and n have a common divisor d (d > 1), R k is not a primitive 
nth root of unity, since 

(R k ) r i = (i2»)3 = 1, 

and the exponent n/d is a positive integer less than n. 

But if k and n are relatively prime, i.e., have no common divisor > 1, R k 
is a primitive nth root of unity. To prove this, we must show that (R k ) 1 / 1 if 
I is a positive integer < n. By De Moivre's theorem, 

m 2kln 2kln 

R Kl = cos h % sin . 

n n 

If this were equal to unity, 2klir/n would be a multiple of 2n, and hence kl a 
multiple of n. Since k is relatively prime to n, the second factor I would be a 
multiple of n, whereas < I < n. 

EXERCISES 

1. Show that the primitive cube roots of unity are u> and u> 2 . 

2. For R given by (7), prove that the primitive nth roots of unity are (i) for 
n = 6, R, R 5 ; (ii) for n = 8, R, R 3 , R 5 , R 7 ; (hi) for n = 12, R, R 5 , R 7 , R 11 . 

3. When n is a prime, prove that any nth root of unity, other than 1, is primitive. 

4. Let R be a primitive nth root (7) of unity, where n is a product of two 
different primes p and q. Show that R, . . . , R n are primitive with the exception 
of R p , R 2p , . . . , R qp , whose qth. powers are unity, and R q , R 2q , . . . , R pq , whose pth 
powers are unity. These two sets of exceptions have only R pq in common. Hence 
there are exactly pq — p — q + 1 primitive nth roots of unity. 

5. Find the number of primitive nth roots of unity if n is a square of a prime p. 

6. Extend Ex. 4 to the case in which n is a product of three distinct primes. 

7. If R is a primitive 15th root (7) of unity, verify that R 3 , R 6 , R 9 , R 12 are the 
primitive hfth roots of unity, and R 5 and R 10 are the primitive cube roots of unity. 
Show that their eight products by pairs give all the primitive 15th roots of unity. 

8. If p is any primitive nth root of unity, prove that p, p 2 , . . . , p n are distinct 
and give all the nth roots of unity. Of these show that p k is a primitive nth root of 
unity if and only if k is relatively prime to n. 



311.1 PRIMITIVE ROOTS OF UNITY 11 



9. Show that the six primitive 18th roots of unity are the negatives of the 
primitive ninth roots of unity. 



CHAPTER II 

Elementary Theorems on the Roots of an Equation 

12. Quadratic Equation. If a, b, c are given numbers, a/0, 

(1) ax 2 + bx + c = (a / 0) 

is called a quadratic equation or equation of the second degree. The reader 
is familiar with the following method of solution by "completing the square." 
Multiply the terms of the equation by 4a, and transpose the constant term; 
then 

9 9 

4a x + Aabx = —Aac. 
Adding b 2 to complete the square, we get 

(2ax + b) 2 = A, A = b 2 - 4ac, 



, . -6 + VA -b- VA 

2) xi = xo = 

w L 2a 2a 

By addition and multiplication, we find that 

(3) x\ + x 2 = — , xix 2 = -• 

a a 

Hence for all values of the variable x, 

(4) a(x — x\){x — X2) = ax — a(x\ + x^)x + ax\x<i = ax + bx + c, 

the sign = being used instead of = since these functions of x are identically 
equal, i.e., the coefficients of like powers of x are the same. We speak of 
a(x — x\)(x — X2) as the factored form of the quadratic function ax 2 + bx + c, 
and of x — x\ and x — x^ as its linear factors. 

In (4) we assign to x the values xi and x 2 in turn, and see that 

9 9 

= axi + fori + c, = ax 2 + 6x2 + c. 
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Hence the values (2) are actually the roots of equation (1). 

We call A = b 2 — Aac the discriminant of the function ax 2 + bx + c or of 
the corresponding equation (1). If A = 0, the roots (2) are evidently equal, 
so that, by (4), ax 2 + bx + c is the square of ^/a(x — aq), and conversely. We 
thus obtain the useful result that ax 2 + bx + c is a perfect square (of a linear 
function of x) if and only if b 2 = 4ac (i.e., if its discriminant is zero). 

Consider a real quadratic equation, i.e., one whose coefficients a, b, c are 
all real numbers. Then if A is positive, the two roots (2) are real. But if A is 
negative, the roots are conjugate imaginaries (§2). 

When the coefficients of a quadratic equation (1) are any complex numbers, 
A has two complex square roots (§9), so that the roots (2) of (1) are complex 
numbers, which need not be conjugate. 

For example, the discriminant of x 2 — 2x + c is A = 4(1 — c). If c = 1, then A = 
and x 2 — 2x + 1 = (x — l) 2 is a perfect square, and the roots 1, 1 of x 2 — 2x + 1 = 
are equal. If c = 0, A = 4 is positive and the roots and 2 of x 2 — 2x = x(x — 2) = 
are real. If c = 2, A = —4 is negative and the roots 1 ± \f—\ of x 2 — 2x + 2 = are 
conjugate complex numbers. The roots oi x 2 — x + l + i = are % and 1—i, and are 
not conjugate. 

13. Integral Rational Function, Polynomial. If n is a positive integer 
and co, c\, . . . , c n are constants (real or imaginary), 

f(x) = cqx u + c\x n ~ X H h c n _ix + c n 

is called a polynomial in x of degree n, or also an integral rational function of x 
of degree n. It is given the abbreviated notation f(x), just as the logarithm of 
x + 2 is written log(x + 2). 

If co / 0, f(x) = is an equation of degree n. If n = 3, it is often called a 
cubic equation; and, if n = 4, a quartic equation. For brevity, we often speak 
of an equation all of whose coefficients are real as a real equation. 

14. The Remainder Theorem. If a polynomial f(x) be divided by x — c 
until a remainder independent of x is obtained, this remainder is equal to f(c), 
which is the value of f(x) when x = c. 

Denote the remainder by r and the quotient by q(x). Since the dividend 
is f(x) and the divisor is x — c, we have 

f(x) = (x — c)q(x) + r, 

identically in x. Taking x = c, we obtain f(c) = r. 

If r = 0, the division is exact. Hence we have proved also the following 
useful theorem. 
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The Factor Theorem. If f( c ) ^ s zero > the polynomial f(x) has the 
factor x - c. In other words, if c is a root of f(x) = 0, x — c is a factor of f(x) . 

For example, 2 is a root of x 3 — 8 = 0, so that x — 2 is a factor of x 3 — 8. 
Another illustration is furnished by formula (4). 

EXERCISES 

Without actual division find the remainder when 
1.x 4 — 3x 2 — x — 6 is divided by x + 3. 

2. x 3 — 3x 2 + 6x — 5 is divided by x — 3. 
Without actual division show that 

3. 18x 10 + 19x 5 + 1 is divisible by x + 1. 

4. 2x 4 — x 3 — 6x 2 + 4x — 8 is divisible by x — 2 and x + 2. 

5. x 4 — 3x 3 + 3x 2 — 3x + 2 is divisible by x — 1 and x — 2. 

6. r 3 — 1, r 4 — 1, r 5 — 1 are divisible by r — 1. 

7. By performing the indicated multiplication, verify that 

r n - 1 = (r - l)(r n_1 + r n ~ 2 + • • • + r + 1). 

8. In the last identity replace r by x/y, multiply by y n , and derive 

x n - y n = (x - y){x n ~ l + x n ~ 2 y + ■■■ + xy n ~ 2 + y n ~ l ). 

9. In the identity of Exercise 8 replace y by —y, and derive 

x n + y n = {x + y){x n ~ l - x n ~ 2 y + xy n " 2 + y n ~ l ), n odd; 

x n -y n = (x + y){x n ~ 1 - x n ~ 2 y -\ \- xy n ~ 2 - y n_1 ), n even. 

Verify by the Factor Theorem that x + y is a factor. 

10. If a, ar, ar 2 , . . . , ar n ~ l are n numbers in geometrical progression (the ratio 
of any term to the preceding being a constant r^l), prove by Exercise 7 that their 
sum is equal to 

a{r n - 1) 

r — 1 
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11. At the end of each of n years a man deposits in a savings bank a dollars. 
With annual compound interest at 4%, show that his account at the end of n years 
will be 

^{(1-04)" -1} 

dollars. Hint: The hnal deposit draws no interest; the prior deposit will amount 
to a(1.04) dollars; the deposit preceding that will amount to a(1.04) 2 dollars, etc. 
Hence apply Exercise 10 for r = 1.04. 



15. Synthetic Division. The labor of computing the value of a polyno- 
mial in x for an assigned value of x may be shortened by a simple device. To 
find the value of 

x A + 3x 3 - 2x - 5 

for x = 2, note that x 4 = x ■ x 3 = 2x 3 , so that the sum of the first two terms 
of the polynomial is 5x 3 . To 5a: 3 = 5 • 2 2 x we add the next term — 2x and 
obtain 18x or 36. Combining 36 with the final term —5, we obtain the desired 
value 31. 

This computation may be arranged systematically as follows. After sup- 
plying zero coefficients of missing powers of x, we write the coefficients in a 
line, ignoring the powers of x. 

1 3 0-2-5 

2 10 20 36 



1 5 10 18 31 

First we bring down the first coefficient 1. Then we multiply it by the given 
value 2 and enter the product 2 directly under the second coefficient 3, add 
and write the sum 5 below. Similarly, we enter the product of 5 by 2 under the 
third coefficient 0, add and write the sum 10 below; etc. The final number 31 
in the third line is the value of the polynomial when x = 2. The remaining 
numbers in this third line are the coefficients, in their proper order, of the 
quotient 

x 3 + 5x 2 + 10cc + 18, 

which would be obtained by the ordinary long division of the given polynomial 
by x - 2. 

We shall now prove that this process, called synthetic division, enables us 
to find the quotient and remainder when any polynomial f(x) is divided by 
x — c. Write 

f(x) = a x n + aix n_1 H \- a n , 
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and let the constant remainder be r and the quotient be 

q(x) = b Q x n ~ l + bix n ~ 2 + ■■■ + bn-i. 
By comparing the coefficients of f(x) with those in 

(x - c)q(x) + r = b x n + (61 - c&o)^ -1 

+ (6 2 - cb{)x n ~ 2 H h (6 n _i - c6 ri _ 2 )x + r - cb n _i, 

we obtain relations which become, after transposition of terms, 

bo = a , bi = ai + cb , b 2 = a 2 + c&i, . . . , fe,j_i = a n _i + cfc,j_2, r = a n + cb n _ x . 

The steps in the work of computing the fe's may be tabulated as follows: 

«0 «l fl 2 • • • a n -i a n \_c_ 

cb cb\ ■■■ cb n _2 cb n -i 



b bi b 2 ■■■ b n _i, r 

In the second space below uq we write bo (which is equal to 00) ■ We multiply 
60 by c and enter the product directly under a\, add and write the sum b\ 
below it. Next we multiply b\ by c and enter the product directly under 02, 
add and write the sum 62 below it; etc. 

EXERCISES 

Work each of the following exercises by synthetic division. 

1. Divide x 3 + 3x 2 — 2x — 5 by x — 2. 

2. Divide 2x 5 - x 3 + 2x - 1 by x + 2. 

3. Divide x 3 + 6x 2 + lOx - 1 by x - 0.09. 

4. Find the quotient of x 3 — 5x 2 — 2x + 24 by x — 4, and then divide the quotient 
by x — 3. What are the roots of x 3 — 5x 2 — 2x + 24 = 0? 

5. Given that x 4 — 2x 3 — 7x 2 + 8x + 12 = has the roots —1 and 2, find the 
quadratic equation whose roots are the remaining two roots of the given equation, 
and find these roots. 

6. If x 4 — 2x 3 — 12x 2 + lOx + 3 = has the roots 1 and —3, find the remaining 
two roots. 

7. Find the quotient of 2x 4 — x 3 — 6x 2 + 4x — 8 by x 2 — 4. 

8. Find the quotient of x 4 — 3x 3 + 3x 2 — 3x + 2 by x 2 — 3x + 2. 

9. Solve Exercises 1, 2, 3, 6, 7 of §14 by synthetic division. 
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16. Factored Form of a Polynomial. Consider a polynomial 

f(x) = c x n + c 1 x n - 1 + --- + c n (c /0), 

whose leading coefficient cq is not zero. If f(x) = has the root a\, which may 
be any complex number, the Factor Theorem shows that f(x) has the factor 
x — «i, so that 

f(x) = (x- ai)Q(x), Q(x) = c x n ~ l + c x x n ~ 2 + ■■■ + c a _ 1 . 

If Q(x) = has the root a 2 , then 

Q(x) = (x - a 2 )Qi(x), f(x) = (x- ai)(x - a 2 )Qi(x). 

If Qi(x) = has the root 03, etc., we finally get 

(5) f(x) = cq(x - a\)(x - a 2 ) ■ ■ ■ (x - a n ). 

We shall deduce several important conclusions from the preceding discus- 
sion. First, suppose that the equation f(x) = of degree n is known to 
have n distinct roots a\,...,a n . In f(x) = (x — a\)Q(x) take x = a 2 ; then 
= (a 2 — ai)Q(a 2 ), whence Q(a 2 ) = and Q(x) = has the root a 2 . Similarly, 
Ql(x) = has the root 03, etc. Thus all of the assumptions (each introduced 
by an "if") made in the above discussion have been justified and we have the 
conclusion (5). Hence if an equation f(x) = of degree n has n distinct roots 
«!,..., a n , f(x) can be expressed in the factored form (5). 

It follows readily that the equation can not have a root a different from 
a\, . . . , a n . For, if it did, the left member of (5) is zero when x = a and hence 
one of the factors of the right member must then be zero, say a—aj = 0, whence 
the root a is equal to ay. We have now proved the following important result. 

THEOREM. An equation of degree n cannot have more than n distinct 
roots. 

17. Multiple Roots. 1 Equalities may occur among the a's in (5). Sup- 
pose that exactly mi of the a's (including ai) are equal to ay, that a 2 ^ a\, 
while exactly m 2 of the a's are equal to a 2 ; etc. Then (5) becomes 



(6) f(x) = c (x-a 1 ) ni i(x-a 2 ) m i---(x-a k ) m K, m 1 +m 2 + --- + m k 



n. 



where ai,...,a k are distinct. We then call a\ a root of multiplicity mi of 
f(x) = 0, a 2 a root of multiplicity m 2 , etc. In other words, a\ is a root of 



1 Multiple roots are treated by calculus in §58. 
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multiplicity m\ of f(x) = if f(x) is exactly divisible by (x — ai) mi , but is not 
divisible by (x — ai) mi+1 . We call a\ also an mi-fold root. In the particular 
cases mi = 1, 2, and 3, we also speak of cq as a simple root, double root, and 
triple root, respectively. For example, 4 is a simple root, 3 a double root, —2 a 
triple root, and 6 a root of multiplicity 4 (or a 4-fold root) of the equation 

7(x - 4)(x - 3) 2 (x + 2) 3 (x - 6) 4 = 

of degree 10 which has no further root. This example illustrates the next 
theorem, which follows from (6) exactly as the theorem in §16 followed from (5). 

THEOREM. An equation of degree n cannot have more than n roots, a 
root of multiplicity m being counted as m roots. 

18. Identical Polynomials. // two polynomials in x, 

a x n + a x x n ~ X + ■ ■ ■ + an, b x n + bix n ~ X + ■ ■ ■ + b n , 

each of degree n, are equal in value for more than n distinct values of x, they 
are term by term identical, i.e., uq = bo, a\ = b\, . . . ,a n = b n . 

For, taking their difference and writing cq = ao - bo, . . . , c n = a n — b n , we 
have 

c Q x n + c 1 x n ~ l + --- + c n = 

for more than n distinct values of x. If cq / 0, we would have a contradiction 
with the theorem in §16. Hence cq = 0. If c\ / 0, we would have a contradiction 
with the same theorem with n replaced by n — 1. Hence c\ = 0, etc. Thus 
ao = bo, a\ = b\, etc. 

EXERCISES 

1. Find a cubic equation having the roots 0, 1, 2. 

2. Find a quartic equation having the roots ±1, ±2. 

3. Find a quartic equation having the two double roots 3 and —3. 

4. Find a quartic equation having the root 2 and the triple root 1. 

5. What is the condition that ax 2 + bx + c = shall have a double root? 

6. If aQX n + • • • + a n = has more than n distinct roots, each coefficient is zero. 

7. Why is there a single answer to each of Exercises 1-4, if the coefficient of 
the highest power of the unknown be taken equal to unity? State and answer the 
corresponding general question. 
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19. The Fundamental Theorem of Algebra. Every algebraic equa- 
tion with complex coefficients has a complex (real or imaginary) root. 

This theorem, which is proved in the Appendix, implies that every equation 
of degree n has exactly n roots if a root of multiplicity m be counted as m roots. 
In other words, every integral rational function of degree n is a product of 
n linear factors. For, in §16, equations f(x) = 0, Q(x) = 0, Q\{x) = 0, . . . each 
has a root, so that (5) and (6) hold. 

20. Relations between the Roots and the Coefficients. In §12 we 

found the sum and the product of the two roots of any quadratic equation and 
then deduced the factored form of the equation. We now apply the reverse 
process to any equation 

(7) f(x) = c x n + c x x n - X + • • • + c n = (c /0), 
whose factored form is 

(8) f(x) = c Q (x - ai)(x - o 2 ) • • • (x - a n ). 

Our next step is to find the expanded form of this product. The following 
special products may be found by actual multiplication: 

(x — a\)(x — 02) = x — (a\ + a^)x + 0102, 
(x — ol\)[x — a<2){x — 03) = x — (ai + £*2 + a z) x 

+ («ia2 + 010:3 + a.2a3)x — axa^otj,. 
These identities are the cases n = 2 and n = 3 of the following general formula: 

(9) (x - a\)(x - 02) • • • (x - a n ) = x n - (a\ -\ h a n )x n ~ 

+ (ai«2 + "103 + «2 a 3 + • • • + ct n _ia n )x n ~ 

- (01020:3 + 010204 + • • • + a n -2a n -ia n )x n ~ 

+ h (-l) ri oi02- ••(Xn, 

the quantities in parentheses being described in the theorem below. If we 
multiply each member of (9) by x — o n +l> it is n ot much trouble to verify that 
the resulting identity can be derived from (9) by changing n into n + 1, so that 

(9) is proved true by mathematical induction. Hence the quotient of (7) by cq 
is term by term identical with (9), so that 

Oi + 2 + h«n = -Cl/CQ, 

OlQ2 + OlQ3 + 0203 H h « n -l«Ti = C2/C0, 

(10) 010203 + 010204 + h 0^-2071-1071 = -C3/C0, 

0102 • • • a n -\a n = (-l) n c n /cQ. 
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These results may be expressed in the following words: 

THEOREM. Ifa\, . . . , a n are the roots of equation (7), the sum of the roots 
is equal to —c\/cq, the sum of the products of the roots taken two at a time 
is equal to 02/00, the sum of the products of the roots taken three at a time is 
equal to —03/00, etc.; finally, the product of all the roots is equal to (—l) n c n /co- 

Since we may divide the terms of our equation (7) by cq, the essential part 
of our theorem is contained in the following simpler statement: 

COROLLARY. In an equation in x of degree n, in which the coefficient 
of x n is unity, the sum of the n roots is equal to the negative of the coefficient 
of x n_1 , the sum of the products of the roots two at a time is equal to the 
coefficient of x n ~ 2 , etc.; finally the product of all the roots is equal to the 
constant term or its negative, according as n is even or odd. 

For example, in a cubic equation having the roots 2, 2, 5, and having unity as 
the coefficient of x 3 , the coefficient ofxis2-2 + 2-5 + 2-5=24. 

EXERCISES 

1. Find a cubic equation having the roots 1, 2, 3. 

2. Find a quartic equation having the double roots 2 and —2. 

3. Solve x 4 — 6x 3 + 13x 2 — 12x + 4 = 0, which has two double roots. 

4. Prove that one root of x 3 + px 2 + qx + r = is the negative of another root 
if and only if r = pq. 

5. Solve 4x 3 — 16x 2 — 9x + 36 = 0, given that one root is the negative of another. 

6. Solve x 3 — 9x 2 + 23x — 15 = 0, given that one root is the triple of another. 

7. Solve x 4 — 6x 3 + 12x 2 — lOx + 3 = 0, which has a triple root. 

8. Solve x 3 — 14x 2 — 84x + 216 = 0, whose roots are in geometrical progression, 
i.e., with a common ratio r [say m/r, m, mr\. 

9. Solve x 3 — 3x 2 — 13x + 15 = 0, whose roots are in arithmetical progression, 
i.e., with a common difference d [say m — d, m, m + d\. 

10. Solve x 4 — 2x 3 — 21x 2 + 22x + 40 = 0, whose roots are in arithmetical pro- 
gression. [Denote them by c — 36, c — b, + 6, c + 36, with the common difference 
26]. 

11. Find a quadratic equation whose roots are the squares of the roots of x 2 — 
px + q = 0. 
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12. Find a quadratic equation whose roots are the cubes of the roots of x 2 — px + 
q = 0. Hint: a 3 + (3 3 = (a + (3) 3 - 3a/3(a + 0). 

13. If a and (3 are the roots of x 2 — px + q = 0, find an equation whose roots are 
(i) a 2 //3; and /3 2 /a; (ii) a 3 /3 and a/3 3 ; (iii) a + 1//3 and /3 + 1/a. 

14. Find a necessary and sufficient condition that the roots, taken in some order, 
of x 3 + px 2 + qx + r = shall be in geometrical progression. 

15. Solve x 3 — 28x + 48 = 0, given that two roots differ by 2. 



21. Imaginary Roots occur in Pairs. The two roots of a real quadratic 
equation whose discriminant is negative are conjugate imaginaries (§12). This 
fact illustrates the following useful result. 

THEOREM. If an algebraic equation with real coefficients has the root a + 
hi, where a and b are real and b / 0, it has also the root a — bi. 
Let the equation be f(x) = and divide f(x) by 

(11) (x - a) 2 + b 2 = (x- a- bi)(x - a + bi) 

until we reach a remainder rx + s whose degree in x is less than the degree of 
the divisor. Since the coefficients of the dividend and divisor are all real, those 
of the quotient Q(x) and remainder are real. We have 

f(x) = Q(x){(x - a) 2 + fe 2 } + rx + s, 

identically in x. This identity is true in particular when x = a + bi, so that 

= r{a + bi) + s = ra + s + rbi. 

Since all of the letters, other than i, denote real numbers, we have (§2) ra + s = 
0, rb = 0. But b / 0. Hence r = 0, and then s = 0. Hence f(x) is exactly 
divisible by the function (11), so that f(x) = has the root a — bi. 

The theorem may be applied to the real quotient Q(x). We obtain the 

COROLLARY. If a real algebraic equation has an imaginary root of mul- 
tiplicity m, the conjugate imaginary of this root is a root of multiplicity m. 

Counting a root of multiplicity m as m roots, we see that a real equation 
cannot have an odd number of imaginary roots. Hence by §19, a real equation 
of odd degree has at least one real root. 

Of the n linear factors of a real integral rational function of degree n (§19), 
those having imaginary coefficients may be paired as in (11). Hence every 
integral rational function with real coefficients can be expressed as a product of 
real linear and real quadratic factors. 
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EXERCISES 

1. Solve x 3 — 3x 2 — 6x — 20 = 0, one root being — 1 + \/ — 3. 

2. Solve x 4 — 4x 3 + 5x 2 — 2x — 2 = 0, one root being 1 — i. 

3. Find a cubic equation with real coefficients two of whose roots are 1 and 
3 + 2i. 

4. If a real cubic equation x 3 — 6x 2 + • • • = has the root 1 + \/— 5, what are 
the remaining roots? Find the complete equation. 

5. If an equation with rational coefficients has a root a + \/b, where a and b are 
rational, but \/b is irrational, prove that it has the root a — \/b. [Use the method 
of §21.] 

6. Solve x 4 — 4x 3 + 4x — 1 = 0, one root being 2 + V3. 

7. Solve x 3 - (4 + \/3)x 2 + (5 + 4 v / 3)x - 5\/3 = 0, having the root \/3. 

8. Solve the equation in Ex. 7, given that it has the root 2 + i. 

9. Find a cubic equation with rational coefficients having the roots j, 2 + ^2. 

10. Given that x 4 — 2x 3 — 5x 2 — 6x + 2 = has the root 2 — V3, find another 
root and by means of the sum and the product of the four roots deduce, without 
division, the quadratic equation satisfied by the remaining two roots. 

11. Granted that a certain cubic equation has the root 2 and no real root different 
from 2, does it have two imaginary roots? 

12. Granted that a certain quartic equation has the roots 2±3i, and no imaginary 
roots different from them, does it have two real roots? 

13. By means of the proof of Ex. 5, may we conclude as at the end of §21 
that every integral rational function with rational coefficients can be expressed as a 
product of linear and quadratic factors with rational coefficients? 



22. Upper Limit to the Real Roots. Any number which exceeds all 
real roots of a real equation is called an upper limit to the real roots. We shall 
prove two theorems which enable us to find readily upper limits to the real 
roots. For some equations Theorem I gives a better (smaller) upper limit than 
Theorem II; for other equations, the reverse is true. Evidently any positive 
number is an upper limit to the real roots of an equation having no negative 
coefficients. 
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Theorem I. If, in a real equation 

f(x) = a x n + a x x n ~ x H \- a n = (a > 0), 

£/ie /irsi negative coefficient is preceded by k coefficients which are positive or 
zero, and if G denotes the greatest of the numerical values of the negative 
coefficients, then each real root is less than 1 + ^/gJoq. 

For example, in x 5 + 4x 4 — 7x 2 — 40x + 1 = 0, G = 40 and k = 3 since we must 
supply the coefficient zero to the missing power x 3 . Thus the theorem asserts that 
each root is less than 1 + v40 and therefore less than 4.42. Hence 4.42 is an upper 
limit to the roots. 

Proof. For positive values of x, f(x) will be reduced in value or remain 
unchanged if we omit the terms aix n_1 , . . . , a^_i x n ~ k+1 (which are positive or 
zero), and if we change each later coefficient aj~, . . . ,a n to —G. Hence 

f(x) ^ a x n - G(x n ~ k + x n - k ~ l + • • • + x + 1). 



But, by Ex. 7 of §14, 



x n fc H Vx + l = 



x n-k+l _ 1 
x — 1 



if x / 1. Furthermore, 

„n-fc+l _ A _ x n - k+1 {a Q x k - 1 (x -1)-G} + G 



aQX 
Hence, if x > 1 




x — 1 / x — 1 



x n - k+1 {a x k - 1 (x - 1) - G] 
f{x) > 



/(*)> 



x — 1 
x n - k+1 {a (x-l) k -G} 



x — 1 

Thus, for x > 1, f(x) > and x is not a root if ao(x - l) fc - G ^ 0, which is 
true if x ^ 1 + \fG~Ja~Q. 

23. Another Upper Limit to the Roots. 

THEOREM II. If, in a real algebraic equation in which the coefficient of the 
highest power of the unknown is positive, the numerical value of each negative 
coefficient be divided by the sum of all the positive coefficients which precede 
it, the greatest quotient so obtained increased by unity is an upper limit to the 
roots. 
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For the example in §22, the quotients are 7/(1 + 4) and 40/5, so that Theorem II 
asserts that 1 + 8 or 9 is an upper limit to the roots. Theorem I gave the better 
upper limit 4.42. But for a: 3 + 8x 2 — 9x + c 2 = 0, Theorem I gives the upper limit 4, 
while Theorem II gives the better upper limit 2. 

We first give the proof for the case of the equation 

f(x) = P4X 4 - P3X 3 + P2X 2 - V\X + po = 

in which each pi is positive. In view of the identities 

x 4 = (x - l)(x 3 + x 2 + x + 1) + 1, x 2 = (x - l)(x + 1) + 1, 

f(x) is equal to the sum of the terms 

Pa{x — l)x + p^{x — l)x + Pa{x — l)x + p^ix — 1) + P4, 

- P3X 3 +p 2 (x - l)x + p 2 (x - 1) + P2, 

~P\X +p - 

If x > 1, negative terms occur only in the first and third columns, while the sum of 
the terms in each of these two columns will be ^ if 

P4,(x - 1) - p 3 ^ 0, (p 4 + p 2 )(x - 1) - pi ^ 0. 

Hence f(x) > and x is not a root if 

x > 1 H , x > 1 + 



P4 P4 + P2 

This proves the theorem for the present equation. 

Next, let f(x) be modified by changing its constant term to —po- We modify the 
above proof by employing the sum (p4+P2)x—po of all the terms in the corresponding 
last two columns. This sum will be > if x > Po/(pa + P2), which is true if 

x>l+ P0 



P4+P2 



To extend this method of proof to the general case 

f(x) = a n x n + • • • + a (a n >0), 

we have only to employ suitable general notations. Let the negative coefficients 
be a^, . . . , ak t , where k\ > ki > ■ ■ ■ > kt- For each positive integer m which is 
^ n and distinct from ki,...,kt, we replace x m by the equal value 

d{x m - 1 + x m - 2 + ■ ■ ■ + X + 1) + 1 

where d = x — 1. Let F(x) denote the polynomial in x, with coefficients involv- 
ing d, which is obtained from f(x) by these replacements. Let x > 1, so that 
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d is positive. Thus the terms a^.x ki are the only negative quantities occurring 
in F(x). If hi > 0, the terms of F(x) which involve explicitly the power x ki are 
a^.x ki and the a m dx ki for the various positive coefficients a m which precede a^.. 
The sum of these terms will be ^ if a^. + dJ2 a m = 0, i.e., if 

x > 1 + ~° kl 



Y, a m' 



There is an additional case if kt = 0, i.e., if ag is negative. Then the terms 
of F(x) not involving x explicitly are ciq and the a m (d + 1) for the various 
positive coefficients a m . Their sum, ao + x^a mi will be > if 



x > 



which is true if 

x>I + ~ a ° 



XX 



EXERCISES 

Apply the methods of both §22 and §23 to find an upper limit to the roots of 

1. 4x 5 - 8x 4 + 22x 3 + 98x 2 - 73x + 5 = 0. 

2. x 4 - 5x 3 + 7x 2 - 8x + I = 0. 

3. x 7 + 3x 6 - 4x 5 + 5x 4 - 6x 3 - 7x 2 - 8 = 0. 

4. x 7 + 2x 5 + 4x 4 - 8x 2 - 32 = 0. 

5. A lower limit to the negative roots of f(x) = may be found by applying our 
theorems to f(—x) = 0, i.e., to the equation derived from /(x) = by replacing x 
by —x. Find a lower limit to the negative roots in Exs. 2, 3, 4. 

6. Prove that every real root of a real equation f(x) = is less than 1 + g/ao if 
ao > 0, where g denotes the greatest of the numerical values of a\, . . . , a n . Hint: if 
x > 0, 

a x n + aix ra_1 H ^ a x n - g{x n ~ l -\ h x + 1). 

Proceed as in §22 with k = 1. 

7. Prove that 1 + g -j- |«o| is an upper limit for the moduli of all complex roots 
of any equation f(x) = with complex coefficients, where g is the greatest of the 
values |ai|, . . . , \a n \, and \a\ denotes the modulus of a. Hint: use Ex. 5 of §8. 
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24. Integral Roots. For an equation all of whose coefficients are inte- 
gers, any integral root is an exact divisor of the constant term. 
For, if x is an integer such that 

(12) aQX n + h a n _ix + a n = 0, 

where the a's are all integers, then, by transposing terms, we obtain 

x(-a x ri_1 a n _i) = a n . 

Thus x is an exact divisor of a n since the quotient is the integer given by the 
quantity in parenthesis. 

Example 1. Find all the integral roots of 

x 3 + x 2 - 3x + 9 = 0. 



Solution. The exact divisors of the constant term 9 are ±1, ±3, ±9. By trial, 
no one of ±1, 3 is a root. Next, we find that —3 is a root by synthetic division (§15): 

11-39 | -3 
-3 6 -9 



1-230 



Hence the quotient is x 2 — 2x + 3, which is zero for x = 1 ± \/— 2. Thus —3 is 
the only integral root. 

When the constant term has numerous exact divisors, some device may 
simplify the application of the theorem. 

Example 2. 2 Find all the integral roots of 

y 3 + 12y 2 - 32y - 256 = 0. 



Solution. Since all the terms except y 3 are divisible by 2, an integral root y 
must be divisible by 2. Since all the terms except y 3 are now divisible by 2 4 , we 
have y = 4z, where z is an integer. Removing the factor 2 6 from the equation in z, 
we obtain 

z 3 + 3z 2 - 2z - 4 = 0. 

An integral root must divide the constant term 4. Hence, if there are any integral 

roots, they occur among the numbers ±1, ±2, ±4. By trial, —1 is found to be a 

root: 

1 3-2-4 1 -1 

-1 -2 4 



12-40 



2 This problem is needed for the solution (§48) of a certain quartic equation. 
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Hence the quotient is z 2 + 2z — 4, which is zero for z = — 1 ± y5. Thus y = 4z = —4 
is the only integral root of the proposed equation. 

EXERCISES 

Find all the integral roots of 

1. x 3 + 8x 2 + 13x + 6 = 0. 2. x 3 - 5x 2 - 2x + 24 = 0. 

3. x 3 - 10x 2 + 27x - 18 = 0. 4. x 4 + 4x 3 + 8x + 32 = 0. 

5. The equation in Ex. 4 of §23. 



25. Newton's Method for Integral Roots. In §24 we proved that an 
integral root x of equation (12) having integral coefficients must be an exact 
divisor of a n . Similarly, if we transpose all but the last two terms of (12), we 
see that a n -\x + a n must be divisible by x 2 , and hence a n -\ + a n /x divisible 
by x. By transposing all but the last three terms of (12), we see that their sum 
must be divisible by x 3 , and hence a n -2 + (a n _i + a n /x)/x divisible by x. We 
thus obtain a series of conditions of divisibility which an integral root must 
satisfy. The final sum ao + a\/x + • • • must not merely be divisible by x, but 
be actually zero, since it is the quotient of the function (12) by x n . 

In practice, we must test in turn the various divisors x of a n . If a chosen x 
is not a root, that fact will be disclosed by one of the conditions mentioned. 
Newton's method is quicker than synthetic division since it usually detects 
early and throws out wrong guesses as to a root, whereas in synthetic division 
the decision comes only at the final step. 

For example, the divisor —3 of the constant term of 

(13) /(x) = x 4 - 9x 3 + 24x 2 - 23x + 15 = 

is not a root since —23 + 15/(— 3) = —28 is not divisible by —3. To show that none 
of the tests fails for 3, so that 3 is a root, we may arrange the work systematically 
as follows: 



(14) 



1 -9 24 -23 15 

-1 6-6 5 

-3 18 -18 



3 

(divisor) 



First we divide the hnal coefficient 15 by 3, place the quotient 5 directly under the 
coefficient —23, and add. Next, we divide this sum —18 by 3, place the quotient —6 
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directly under the coefficient 24, and add. After two more such steps we obtain the 
sum zero, so that 3 is a root. 

It is instructive to obtain the preceding process by suitably modifying synthetic 
division. First, we replace x by 1/y in (13), multiply each term by y 4 , and obtain 

15y 4 - 23y 3 + 24y 2 - 9y + 1 = 0. 

We may test this for the root y = g, which corresponds to the root x = 3 of (13), 
by ordinary synthetic division: 



l 
3 
(multiplier) 



15 -23 24 -9 1 
5-6 6-1 

15 -18 18 -3 

The coefficients in the last two lines (after omitting 15) are the same as those of the 
last two lines in (14) read in reverse order. This should be the case since we have 
here multiplied the same numbers by „- that we divided by 3 in (14). The numbers 
in the present third line are the coefficients of the quotient (§15). Since we equate 
the quotient to zero for the applications, we may replace these coefficients by the 
numbers in the second line which are the products of the former numbers by g . The 
numbers in the second line of (14) are the negatives of the coefficients of the quotient 
of f(x) by x — 3. 

Example. Find all the integral roots of equation (13). 

Solution. For a negative value of x, each term is positive. Hence all the real 
roots are positive. By §23, 10 is an upper limit to the roots. By §24, any integral 
root is an exact divisor of the constant term 15. Hence the integral roots, if any, 
occur among the numbers 1, 3, 5. Since /(l) = 8, 1 is not a root. By (14), 3 is 
a root. Proceeding similarly with the quotient by x — 3, whose coefficients are the 
negatives of the numbers in the second line of (14), we find that 5 is a root. 

EXERCISES 

1. Solve Exs. 1-4 of §24 by Newton's method. 

2. Prove that, in extending the process (14) to the general equation (12), we 
may employ the final equations in §15 with r = and write 

c 
(divisor) 



"0 


a i 


a 2 


On-2 


O-n-1 


-bo 


-h 


-b 2 


-b n -2 


-b n -l 



-cb -ch . . . -cb n - 3 -cb n - 2 

Here the quotient, —b n -\, of a n by c is placed directly under a n _i, and added to it 
to yield the sum —cb n -2, etc. 
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26. Another Method for Integral Roots. An integral divisor d of the 

constant term is not a root if d — m is not a divisor of f(m), where m is any 
chosen integer. For, if d is a root of f(x) = 0, then 

f(x) = (x-d)Q(x), 

where Q(x) is a polynomial having integral coefficients (§15). Hence f(m) = 
(m — d)q, where q is the integer Q(m). 

In the example of §25, take d = 15, m — 1. Since /(l) = 8 is not divisible by 
15 — 1 = 14, 15 is not an integral root. 

Consider the more difficult example 

f(x) = x 3 - 20a; 2 + 164a; - 400 = 0, 

whose constant term has many divisors. There is evidently no negative root, while 
21 is an upper limit to the roots. The positive divisors less than 21 of 400 = 2 4 5 2 are 
d = 1, 2, 4, 8, 16, 5, 10, 20. First, take m = 1 and note that /(l) = -255 = -3-5-17. 
The corresponding values of d — 1 are 0, 1, 3, 7, 15, 4, 9, 19; of these, 7, 4, 9, 19 are 
not divisors of /(l), so that d = 8, 5, 10 and 20 are not roots. Next, take m = 2 
and note that /(2) = —144 is not divisible by 16 — 2 = 14. Hence 16 is not a root. 
Incidentally, d = 1 and d = 2 were excluded since /(d) ^ 0. There remains only 
d = 4, which is a root. 

In case there are numerous divisors within the limits to the roots, it is 
usually a waste of time to list all these divisors. For, if a divisor is found to 
be a root, it is preferable to employ henceforth the quotient, as was done in 
the example in §25. 



EXERCISES 

Find all the integral roots of 

1. x 4 - 2x 3 - 2\x 2 + 22a; + 40 = 0. 

2. y 3 - 9y 2 - 24y + 216 = 0. 

3. x 4 - 23a; 3 + 187a; 2 - 653a; + 936 = 0. 

4. x 5 + 47x 4 + 423a; 3 + 140a; 2 + 1213a; - 420 = 0. 

5. x 5 - 34x 3 + 29a; 2 + 212x - 300 = 0. 
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27. Rational Roots. If an equation with integral coefficients 

(15) c x n + c\x n ~ l H Vc n -ix + c n = 

has the rational root a/b, where a and b are integers without a common divisor 
> 1, then a is an exact divisor of c n , and b is an exact divisor of cq. 

Insert the value a/b of x and multiply all terms of the equation by b n . We 
obtain 

c a n + c 1 a n ~ 1 b -\ \- c n _xab n ~ X + c n b n = 0. 

Since a divides all the terms preceding the last term, it divides that term. But 
a has no divisor in common with b n ; hence a divides c n - Similarly, b divides 
all the terms after the first term and hence divides cq. 

Example. Find all the rational roots of 

2x 3 - 7x 2 + 10a; - 6 = 0. 



Solution. By the theorem, the denominator of any rational root x is a divisor 
of 2. Hence y = 2x is an integer. Multiplying the terms of our equation by 4, we 
obtain 

y 3 - 7y 2 + 20y - 24 = 0. 

There is evidently no negative root. By either of the tests in §§22, 23, an upper limit 
to the positive roots of our equation in x is 1 + 7/2, so that y < 9. Hence the only 
possible values of an integral root y are 1, 2, 3, 4, 6, 8. Since 1 and 2 are not roots, 
we try 3: 

1 -7 20 -24 [3_ 

-1 4 -8 

-3 12 

Hence 3 is a root and the remaining roots satisfy the equation y 1 — Ay + 8 = and 
are 2 ± 2i. Thus the only rational root of the proposed equation is x = 3/2. 
If co = 1, then b = ±1 and a/b is an integer. Hence we have the 

COROLLARY. Any rational root of an equation with integral coefficients, 
that of the highest power of the unknown being unity, is an integer. 
Given any equation with integral coefficients 

a y n + axtf 1 ' 1 + • • • + a n = 0, 

we multiply each term by aQ n ~ 1 , write agy = x, and obtain an equation (15) 
with integral coefficients, in which the coefficient co of x n is now unity. By the 
Corollary, each rational root x is an integer. Hence we need only find all the 
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integral roots x and divide them by ao to obtain all the rational roots y of the 
proposed equation. 

Frequently it is sufficient (and of course simpler) to set ky = x, where k is 
a suitably chosen integer less than ao- 

EXERCISES 

Find all of the rational roots of 

1. y A - f y 3 + ±§V - 40y + 9 = 0. 2. 6y 3 - f ly 2 + 6y - 1 = 0. 

3. f 08y 3 - 270y 2 - 42y + f = 0. [Use k = 6.] 

4. 32y 3 -Qy- 1 = 0. [Use the least k.\ 

5. 96y 3 - 16y 2 - 6y + 1 = 0. 6. 24y 3 - 2y 2 - 5y + 1 = 0. 

7. y 3 - \y 2 - 2y + 1 = 0. 8. y 3 - § y 2 + 3y - 2 = 0. 

9. Solve Exs. 2-6 by replacing y by 1/x. 

Find the equations whose roots are the products of 6 by the roots of 

10. y 2 - 2y - \ = 0. 11. y 3 - \y 2 - \y + \ = 0. 



CHAPTER III 

Constructions with Ruler and Compasses 

28. Impossible Constructions. We shall prove that it is not possible, 
by the methods of Euclidean geometry, to trisect all angles, or to construct 
a regular polygon of 7 or 9 sides. The proof, which is beyond the scope 
of elementary geometry, is based on principles of the theory of equations. 
Moreover, the discussion will show that a regular polygon of 17 sides can be 
constructed with ruler and compasses, a fact not suspected during the twenty 
centuries from Euclid to Gauss. 



29. Graphical Solution of a Quadratic Equation. 

constructible, and 



If a and b are 



(1) 



ax + b = 



has real coefficients and real roots, 
the roots can be constructed with 
ruler and compasses as follows. Draw 
a circle having as a diameter the 
line BQ joining the points B = (0, 1) 
and Q = (a, b) in Fig. 6. Then the 
abscissas ON and OM of the points 
of intersection of this circle with the 
x-axis are the roots of (1). 

For, the center of the circle is 
(a/2, (6+ l)/2); the square of BQ is 




a 2 + (b — l) 2 ; hence the equation of the circle is 

x - - I -!- I y 



Fig. 6 



a 2 + (6-1) 



This is found to reduce to (1) when y = 0, which proves the theorem. 

When the circle is tangent to the x-axis, so that M and iV coincide, the two 
roots are equal. When the circle does not cut the x-axis, or when Q coincides 
with B, the roots are imaginary. 
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Another construction follows from §30. 

EXERCISES 

Solve graphically: 

1. x 2 - 5x + 4 = 0. 2. x 2 + 5x + 4 = 0. 3. x 2 + 5x - 4 = 0. 

4. x 2 - 5x - 4 = 0. 5. x 2 - 4x + 4 = 0. 6. x 2 - 3x + 4 = 0. 



30. Analytic Criterion for Constructibility. The first step in our 
consideration of a problem proposed for construction consists in formulating 
the problem analytically. In some instances elementary algebra suffices for this 
formulation. For example, in the ancient problem of the duplication of a cube, 
we take as a unit of length a side of the given cube, and seek the length x of 
a side of another cube whose volume is double that of the given cube; hence 

(2) x 3 = 2. 

But usually it is convenient to employ analytic geometry as in §29; a point 
is determined by its coordinates x and y with reference to fixed rectangular 
axes; a straight line is determined by an equation of the first degree, a circle 
by one of the second degree, in the coordinates of the general point on it. 
Hence we are concerned with certain numbers, some being the coordinates of 
points, others being the coefficients of equations, and still others expressing 
lengths, areas or volumes. These numbers may be said to define analytically 
the various geometric elements involved. 

CRITERION. A proposed construction is possible by ruler and compasses 
if and only if the numbers which define analytically the desired geometric ele- 
ments can be derived from those defining the given elements by a finite number 
of rational operations and extractions of real square roots. 

In §29 we were given the numbers a and b, and constructed lines of lengths 



Ua±^a 2 -Ab). 



Proof. First, we grant the condition stated in the criterion and prove 
that the construction is possible with ruler and compasses. For, a rational 
function of given quantities is obtained from them by additions, subtractions, 
multiplications, and divisions. The construction of the sum or difference of 
two segments is obvious. The construction, by means of parallel lines, of a 



530. 
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segment whose length p is equal to the product a ■ b of the lengths of two given 
segments is shown in Fig. 7; that for the quotient q = a/b in Fig. 8. Finally, 
a segment of length s = \pn may be constructed, as in Fig. 9, by drawing a 
semicircle on a diameter composed of two segments of lengths 1 and n, and 
then drawing a perpendicular to the diameter at the point which separates the 
two segments. Or we may construct a root of x 1 — n = by §29. 






b 

Fig. 7 Fig. 8 Fig. 9 

Second, suppose that the proposed construction is possible with ruler and 
compasses. The straight lines and circles drawn in making the construction are 
located by means of points either initially given or obtained as the intersections 
of two straight lines, a straight line and a circle, or two circles. Since the axes 
of coordinates are at our choice, we may assume that the y-axis is not parallel 
to any of the straight lines employed in the construction. Then the equation 
of any one of our lines is 



(3) 



y = mx + b. 



Let y = m'x + b 1 be the equation of another of our lines which intersects (3). 
The coordinates of their point of intersection are 



b'-b 



m — m 



mb 1 — m'b 



m — m 



which are rational functions of the coefficients of the equations of the two lines. 
Suppose that a line (3) intersects the circle 



(x - c) 2 + (y-d) 



2 2 

- r , 



with the center (c, d) and radius r. To find the coordinates of the points of 
intersection, we eliminate y between the equations and obtain a quadratic 
equation for x. Thus x (and hence also mx + b or y) involves no irrationality 
other than a real square root, besides real irrationalities present in m, b, c, 
d, r. 
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Finally, the intersections of two circles are given by the intersections of one 
of them with their common chord, so that this case reduces to the preceding. 

For example, a side of a regular pentagon inscribed in a circle of radius unity is 
(Ex. 2 of §37) 



(4) s = \\J VQ - 2\/l, 

which is a number of the type mentioned in the criterion. Hence a regular pentagon 
can be constructed by ruler and compasses (see the example above quoted). 

31. Cubic Equations with a Constructible Root. We saw that the 
problem of the duplication of a cube led to a cubic equation (2). We shall later 
show that each of the problems, to trisect an angle, and to construct regular 
polygons of 7 and 9 sides with ruler and compasses, leads to a cubic equation. 
We shall be in a position to treat all of these problems as soon as we have 
proved the following general result. 

THEOREM. It is not possible to construct with ruler and compasses a line 
whose length is a root or the negative of a root of a cubic equation with rational 
coefficients having no rational root. 

Suppose that x\ is a root of 

(5) x 3 + ax 2 + fix + 7 = (a, fi, 7 rational) 

such that a line of length x\ or —x\ can be constructed with ruler and com- 
passes; we shall prove that one of the roots of (5) is rational. We have only to 
discuss the case in which x\ is irrational. 

By the criterion in §30, since the given numbers in this problem are a, fi, 7, 
all rational, x\ can be obtained by a finite number of rational operations and 
extractions of real square roots, performed upon rational numbers or numbers 
derived from them by such operations. Thus x\ involves one or more real 
square roots, but no further irrationalities. 

As in the case of (4), there may be superimposed radicals. Such a two-story 
radical which is not expressible as a rational function, with rational coefficients, 
of a finite number of square roots of positive rational numbers is said to be a 
radical of order 2. In general, an n-story radical is said to be of order n if it 
is not expressible as a rational function, with rational coefficients, of radicals 
each with fewer than n superimposed radicals, the innermost ones affecting 
positive rational numbers. 

We agree to simplify x\ by making all possible replacements of certain 
types that are sufficiently illustrated by the following numerical examples. 
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If xi involves a/3, V§, and \/l5, we agree to replace a/15 by a/3 • a/5. If 
xi = s — It, where s is given by (4) and 



t= \\Jl$ + 2Vh, 

so that st = a/5, we agree to write x\ in the form s — 7a/5/s, which involves a 
single radical of order 2 and no new radical of lower order. Finally, we agree 
to replace \/4 — 2a/3 by its simpler form a/3 — 1. 

After all possible simplifications of these types have been made, the result- 
ing expressions have the following properties (to be cited as our agreements): 
no one of the radicals of highest order n in x\ is equal to a rational function, 
with rational coefficients, of the remaining radicals of order n and the radicals 
of lower orders, while no one of the radicals of order n — 1 is equal to a ratio- 
nal function of the remaining radicals of order n — 1 and the radicals of lower 
orders, etc. 

Let Vk be a radical of highest order n in x\. Then 

a + bVk 

x\ = 7=, 

c + d\y k 

where a, b, c, d do not involve Vk, but may involve other radicals. If d = 0, 
then c / and we write e for a/c, f for b/c, and get 

(6) xi = e + fVk, (/ + 0) 



where neither e nor / involves v k. If d / 0, we derive (6) by multiplying the 
numerator and denominator of the fraction for x\ by c — dVk, which is not zero 
since Vk = c/d would contradict our above agreements. 

By hypothesis, (6) is a root of equation (5). After expanding the powers 
and replacing the square of \fk by k, we see that 



(7) (e + fVkf + a(e + fVkf + (3(e + fVk) + ~f = A + BVk, 

where A and B are certain polynomials in e, /, k and the rational numbers 
a, (3, 7. Thus A + B\fk = 0. If B ^ 0, \//c = — A/B is a rational function, 
with rational coefficients, of the radicals, other than y/k, in x\, contrary to our 
agreements. Hence B = and therefore A = 0. 

When e — fVk is substituted for x in the cubic function (5), the result is the 
left member of (7) with Vk replaced by —Vk, and hence the result is A — B\fk- 
But A = B = 0. This shows that 

(8) x 2 = e - fVk 
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is a new root of our cubic equation. Since the sum of the three roots is equal 
to —a by §20, the third root is 

(9) £3 = —a — x\ — X2 = —a — 2e. 

Now a is rational. If also e is rational, X3 is a rational root and we have 
reached our goal. We next make the assumption that e is irrational and show 
that it leads to a contradiction. Since e is a component part of the constructible 
root (6), its only irrationalities are square roots. Let \/sbe one of the radicals of 
highest order in e. By the argument which led to (6), we may write e = e'+f'y/s, 
whence, by (9), 

(9') x 3 = g + hyTs, (h ± 0) 

where neither g nor h involves \/s. Then by the argument which led to (8), 
g — h\/s is a root, different from X3, of our cubic equation, and hence is equal 
to x\ or %2 since there are only three roots (§16). Thus 

g — h\/s = e ± fVk. 

By definition, y/s is one of the radicals occurring in e. Also, by (9 ; ), every 
radical occurring in g or h occurs in X3 and hence in e = \{— a — 23), by (9), 
a being rational. Hence \/k is expressible rationally in terms of the remaining 
radicals occurring in e and /, and hence in x\, whose value is given by (6). 
But this contradicts one of our agreements. 

32. Trisection of an Angle. For a given angle A, we can construct with 
ruler and compasses a line of length cos A or — cos A, namely the adjacent leg 
of a right triangle, with hypotenuse unity, formed by dropping a perpendicular 
from a point in one side of A to the other, produced if necessary. If it were 
possible to trisect angle A, i.e., construct the angle A/3 with ruler and com- 
passes, we could as before construct a line whose length is ± cos(A/3). Hence if 
we show that this last cannot be done when the only given geometric elements 
are the angle A and a line of unit length, we shall have proved that the angle A 
cannot be trisected. We shall give the proof for A = 120°. 

We employ the trigonometric identity 

o A A 

cos A = 4 cos 3 cos — . 

3 3 

Multiply each term by 2 and write x for 2cos(A/3). Thus 

(10) x 3 - 3x = 2 cos A. 
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For A = 120°, cosA = ~l and (10) becomes 

(11) x 3 -3x+l = 0. 

Any rational root is an integer (§27) which is an exact divisor of the con- 
stant term (§24). By trial, neither +1 nor —1 is a root. Hence (11) has no 
rational root. Hence (§31) it is not possible to trisect all angles with ruler and 
compasses. 

Certain angles, like 90°, 180°, can be trisected. When A = 180°, the equation 
(10) becomes x 3 — 3x = —2 and has the rational root x = 1. It is the rationality of 
a root which accounts for the possibility of trisecting this special angle 180°. 

33. Regular Polygon of 9 Sides, Duplication of a Cube. Since angle 
120° cannot be trisected with ruler and compasses (§32), angle 40° cannot be 
so constructed in terms of angle 120° and the line of unit length as the given 
geometric elements. Since the former of these elements and its cosine are 
constructible when the latter is given, we may take the line of unit length as 
the only given element. In a regular polygon of 9 sides, the angle subtended 
at the center by one side is g • 360° = 40°. Hence a regular polygon of 9 sides 
cannot be constructed with ruler and compasses. Here, as in similar subsequent 
statements where the given elements are not specified, the only such element 
is the line of unit length. 

A rational root of x 3 = 2 is an integer (§27) which is an exact divisor 
of 2. The cubes of ±1 and ±2 are distinct from 2. Hence there is no rational 
root. Hence (§§30, 31) it is not possible to duplicate a cube with ruler and 
compasses. 

34. Regular Polygon of 7 Sides. If we could construct with ruler and 
compasses an angle 5 containing 360/7 degrees, we could so construct a line 
of length x = 2 cos 5. Since IB = 360°, cos 35 = cos 45. But 

2 cos 35 = 2(4 cos 3 B - 3 cos B) = x 3 - 3x, 

2 cos 45 = 2(2 cos 2 25 - 1) = 4(2 cos 2 5 - l) 2 - 2 = (x 2 - 2) 2 - 2. 

Hence 

= x 4 - 4x 2 + 2 - (x 3 - 3x) = (x- 2)(x 3 + x 2 - 2x - 1). 

But x = 2 would give cos 5 = 1, whereas 5 is acute. Hence 

(12) x 3 + x 2 -2x-l = 0. 

Since this has no rational root, it is impossible to construct a regular polygon 
of 7 sides with ruler and compasses. 
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35. Regular Polygon of 7 Sides and Roots of Unity. If 

2vr 2tt 

it = cos \- i sin — , 

7 7 

we saw in §10 that R, R 2 , R 3 , R 4 , R 5 , R 6 , R 7 = 1 give all the roots of y 7 = 1 
and are complex numbers represented by the vertices of a regular polygon 
of 7 sides inscribed in a circle of radius unity and center at the origin of 
coordinates. By §6, 
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We saw in §34 that 2 cos(27r/7) is one of the roots of the cubic equation (12). 
This equation can be derived in a new manner by utilizing the preceding 
remarks on 7th roots of unity. Our purpose is not primarily to derive (12) 
again, but to illustrate some principles necessary in the general theory of the 
construction of regular polygons. 

Removing from y 7 — 1 the factor y — 1, we get 

(13) y 6 + y 5 + y 4 + y 3 + y 2 + y + 1 = 0, 

whose roots are R, R 2 , . . . , i? 6 . Since we know that R + l/R is one of the roots 
of the cubic equation (12), it is a natural step to make the substitution 

/ N 1 

(14) y+-= X 

y 

in (13). After dividing its terms by y 3 , we have 



(13') (y 3 + ^ + y 2 + ^) + (y + -)+ 1 = - 



(15) y 2 + ^ = x 2 - 2, y 3 + -g = x 3 - 3x. 



y 3 J \ y 2 J \ y 

By squaring and cubing the members of (14), we see that 

1 

r y 

Substituting these values in (13 ; ), we obtain 

(12) x 3 + x 2 -2x-l = 0. 

That is, the substitution (14) converts equation (13) into (12). 

If in (14) we assign to y the six values R,...,R®, we obtain only three 
distinct values of x: 

(16) xi = R+- = R+R 6 , x 2 = R 2 + ^2 = i?2 + i?5 > ^3 = ^ 3 +^3 = # 3 +^ 4 - 
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In order to illustrate a general method of the theory of regular polygons, 
we start with the preceding sums of the six roots in pairs and find the cubic 
equation having these sums as its roots. For this purpose we need to calculate 

X\ + X2 + X3, X\X2 + 21X3 + X2X3, X\X2X'S. 

First, by (16), 



xi + x 2 + x 3 = R + R 2 + • • • + R 6 = -1, 



since R, . . . ,R b are the roots of (13). Similarly, 

£1£ 2 + xxx s + x 2 x 3 = 2(R + R 2 + • • • + R 6 ) = -2, 
£1x2x3 = 2 + R + R 2 + • • • + R 6 = 1. 

Consequently (§20), the cubic having x 1? x 2 , X3 as roots is (12). 

36. Reciprocal Equations. Any algebraic equation such that the recip- 
rocal of each root is itself a root of the same multiplicity is called a reciprocal 
equation. 

The equation y 7 — 1 = is a reciprocal equation, since if r is any root, 1/r 
is evidently also a root. Since (13) has the same roots as this equation, with the 
exception of unity which is its own reciprocal, (13) is also a reciprocal equation. 

If r is any root ^ of any equation 

f(y) ^y n + --- + c = 0, 
1/r is a root of /(1/y) = and hence of 

2/ n /Q)=l + --- + cy" = 0. 

If the former is a reciprocal equation, it has also the root 1/r, so that every 
root of the former is a root of the latter equation. Hence, by §18, the left 
member of the latter is identical with cf(y). Equating the constant terms, we 
have c 2 = 1, c = ±1. Hence 

(17) y n f Q) ^ ±/(y). 

Thus if Piy n ~ l is a term of f(y), also ±Piy l is a term. Hence 

(is') f(y) = y n ± 1 + Piiy 11 ' 1 ±v) + V2{y n ~ 2 ± y 2 ) + ■ ■ ■ ■ 

If n is odd, n = 2t + 1, the final term is pt(y t+1 ± y*), and y ± 1 is a factor 
of /(y). In view of (17), the quotient 

/(y) 



Q(y) 



y±l 
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has the property that 

y n ~ x Q Q) = Q(y). 

Comparing this with (17), which implied (18'), we see that Q(y) = is a 
reciprocal equation of the type 

(18) y 2t + 1 + ciG/ 2 *" 1 + y) + c 2 (y 2i - 2 + y 1 ) + • • • + Ci _i(y i+1 + y*" 1 ) + ay 1 = 0. 

If n is even, n = 2t, and if the upper sign holds in (17), then (18') is of the 
form (18). Next, let the lower sign hold in (17). Since a term pty 1 would imply a 
term —pty 1 , we have pt = 0. The final term in (18') is therefore pt-i(y t+1 — y t ~ 1 ). 
Hence f(y) has the factor y 2 — 1. The quotient q(y) = f(y)/(y 2 — 1) has the 
property that 

y n - 2 g(^j^q(y)- 

Comparing this with (17) as before, we see that q(y) = is of the form (18) 
where now 2t = n — 2. Hence, at least after removing one or both of the factors 
y ± 1, any reciprocal equation may be given the form (18). 

The method by which (13) was reduced to a cubic equation may be used 
to reduce any equation (18) to an equation in x of half the degree. First, we 
divide the terms of (18) by y l and obtain 

^ + ^)+c 1 (y- 1 + ^)+'-- + c f -i(y+£)+c i = 0. 

Next, we perform the substitution (14) by either of the following methods: We 
may make use of the relation 

y k y y k ~ l ) V V k ~ 2 

to compute the values of y k + l/y k in terms of x, starting with the special cases 
(14) and (15). For example, 

= x(x 3 - 3x) - (x 2 - 2) = x 4 - 4x 2 + 2. 

Or we may employ the explicit formula (19) of §107 for the sum y k + \jy k of 
the fcth powers of the roots y and 1/y of y 2 — xy + 1 = 0. 
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37. Regular Polygon of 9 Sides and Roots of Unity. If 

2vr 2tt 

R = cos — + i sin — , 

the powers R, R 2 , i? 4 , i? 5 , R 7 , i? 8 , are the primitive ninth roots of unity (§11). 
They are therefore the roots of 



(19) 



1 



y° + y A + 1 = 0. 



Dividing the terms of this reciprocal equation by y 3 and applying the second 
relation (15), we obtain our former cubic equation (11). 

EXERCISES 



1. Show by (16) that the roots of (12) are 2cos27r/7, 2cos47r/7, 2cos67r/7. 

2. The imaginary fifth roots of unity satisfy 
y 4 + y 3 + y 2 + y + 1 = 0, which by the substi- 
tution (14) becomes x 2 + x — 1 = 0. It has the 
root 



R + 



1 
R 



2 cos 



2tt 1 



(VE-i) 



In a circle of radius unity and center O draw two 
perpendicular diameters AOA', BOB' . With the 
middle point M of OA' as center and radius MB 
draw a circle cutting OA at C (Fig. 10). Show 
that OC and BC are the sides s\q and S5 of the 
inscribed regular decagon and pentagon respec- 
tively. Hints: 




MB = \\fb, 
sio = 2sinl8 c 



OC= §(\/5-l) 

= 2 cos — = OC, 

5 



s 5 2 = (2sin36°) 2 



2 1 



2tt 



cos 



5C = Vl + OC 2 = W 10 - 2^ 



-(10-2\/5), s 5 = BC. 



3. If i? is a root of (19) verify as at the end of §35 that R + i? 8 , i? 2 + R 7 , and 
i? 4 + i? 5 are the roots of (11). 

4. Hence show that the roots of (11) are 2cos27r/9, 2cos47t/9, 2cos87r/9. 

5. Reduce y 11 = 1 to an equation of degree 5 in x. 
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6. Solve y 5 — 7y 4 + y 3 — y 2 + ly — 1 = by radicals. [One root is 1.] 

7. After finding so easily in Chapter I the trigonometric forms of the complex 
roots of unity, why do we now go to so much additional trouble to find them alge- 
braically? 

8. Prove that every real root of x 4 + ax 2 + b = can be constructed with ruler 
and compasses, given lines of lengths a and b. 



9. Show that the real roots of x^—px—q = are the abscissas of the intersections 

2#j 2^2^ 



of the parabola y = x 2 and the circle through the origin with the center (^q, ^ + ^p). 



Prove that it is impossible, with ruler and compasses: 

10. To construct a straight line representing the distance from the circular base 
of a hemisphere to the parallel plane which bisects the hemisphere. 

11. To construct lines representing the lengths of the edges of an existing rect- 
angular parallelopiped having a diagonal of length 5, surface area 24, and volume 1. 
2, 3, or 5. 

12. To trisect an angle whose cosine is j, g, 4, g or p/q, where p and q (q > 1) 
are integers without a common factor, and q is not divisible by a cube. 

Prove algebraically that it is possible, with ruler and compasses: 

13. To trisect an angle whose cosine is (4a 3 — 3ab 2 )/b 3 , where the integer a is 
numerically less than the integer b; for example, cos -1 11/16 if a = —1, b = 4. 

14. To construct the legs of a right triangle, given its area and hypotenuse. 

15. To construct the third side of a triangle, given two sides and its area. 

16. To locate the point P on the side BC = 1 of a given square ABCD such that 
the straight line AP cuts DC produced at a point Q for which the length of PQ is 
a given number g. Show that y = BP is a root of a reciprocal quartic equation, and 
solve it when g = 10. 



38. The Periods of Roots of Unity. Before taking up the regular 
polygon of 17 sides, we first explain another method of finding the pairs of 
imaginary seventh roots of unity R and i? 6 , R 2 and i? 5 , i? 3 and R 4 , employed 
in (16). To this end we seek a positive integer g such that the six roots can be 
arranged in the order 

(20) R, BF, R 9 \ R 9 \ R 9 \ R 9 \ 
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where each term is the gth power of its predecessor. Trying g = 2, we find that 
the fourth term would then be i? 8 = R. Hence g ^ 2. Trying g = 3, we obtain 

(21) it, it , it , it , it , it , 

where each term is the cube of its predecessor. 
To define three periods, each of two terms, 

(16') R + R 6 , R 2 + R 5 , R 3 + R 4 , 

we select the first term R of (21) and the third term i? 6 after it and add them, 
then the second term i? 3 and the third term i? 4 after it, and finally R 2 and 
the third term R 5 after it. 

We may also define two periods, each of three terms, 

z\ = R + R 2 + R 4 , z 2 = R 3 + R 6 + R 5 , 

by taking alternate terms in (21). 

Since z\ + z 2 = — 1, z\Z2 = 3 + R + • • • + R 6 = 2, z\ and z<i are the roots of 
z 2 + z + 2 = 0. Then R, R 2 , R 4 are the roots of w 3 - z\w 2 + z 2 w -1 = 0. 

39. Regular Polygon of 17 Sides. Let R be a root ^ 1 of x 17 = 1. 
Then 

:^_I = ijie + fli5 + . . . + R + i = o. 
it — 1 

As in §38, we may take g = 3 and arrange the roots R, . . . , i? 16 so that each is 
the cube of its predecessor: 

P P 3 R 9 P 10 P 13 R 5 R 15 P 11 P 16 R 14 R 8 R 7 R 4 R 12 R 2 R 6 
.ft, it , it , it , it , it , It i it , -ft ,-ft , it , it , it , it , it , it . 

Taking alternate terms, we get the two periods, each of eight terms, 

yi = R + R 9 + i? 13 + i? 15 + i? 16 + i? 8 + R 4 + i? 2 , 

y 2 = R 3 + R 10 + i? 5 + i? 11 + i? 14 + R 7 + i? 12 + i? 6 . 

Hence yi + y 2 = -1. We find that yiy 2 = 4(i? H h i? 16 ) = -4. Thus 

(22) yi, y 2 satisfy y 2 + y - 4 = 0. 
Taking alternate terms in yi, we obtain the two periods 

Zl = R + R 13 + i? 16 + i? 4 , z 2 = i? 9 + i? 15 + i? 8 + i? 2 . 
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Taking alternate terms in y 2 , we get the two periods 

W1 = R 3 + R 5 + R u + R 12 , w 2 = R 10 + R 11 + R 7 + R 6 . 
Thus z\ + Z2 = j/i, wi + w 2 = 2/2- We find that z\Z2 = W1W2 = — 1. Hence 

(23) zi, Z2 satisfy z 2 — y\z — 1 = 0, 

(24) w\, W2 satisfy w — y2W — 1 = 0. 

Taking alternate terms in z\, we obtain the periods 
v 1 = R + R 16 , v 2 = R 13 + R 4 . 

Now, v\ + V2 = z\, v\V2 = ifi- Hence 

(25) vi, V2 satisfy v 2 — z\v + w\ = 0, 

(26) i?, i? 16 satisfy p 2 - v x p + 1 = 0. 

Hence we can find R by solving a series of quadratic equations. Which of 
the sixteen values of R we shall thus obtain depends upon which root of (22) 
is called y\ and which y2, and similarly in (23)-(26). We shall now show what 
choice is to be made in each such case in order that we shall finally get the 
value of the particular root 



2tt 2tt 

it = cos \- i sin — . 

17 17 



Then 



1 2tt 2tt 1 2tt 

— = cos i sin — , v-t = R + — =2 cos — , 

i? 17 17 i? 17 

r.4 8tt . . 8tt „ 4 1 8tt 

it = cos h « sin — , vo = R H — r = 2 cos — . 

17 17 2 R A 17 

Hence v\ > i>2 > 0, and therefore z\ = v\ + V2 > 0. Similarly, 

„q 1 t 1 67T 107T 67T 77T 

ioi = -R H o + i? H f = 2 cos h 2 cos — — = 2 cos 2 cos — - > 0, 

i? 3 i? 5 17 17 17 17 

6tt 10tt 12tt 14tt 

yo = 2 cos h 2 cos \- 2 cos h 2 cos < 0, 

yz 17 17 17 17 

since only the first cosine in j/2 is positive and it is numerically less than the 
third. But y 1 y 2 =- -4. Hence y 1 > 0. Thus (22)-(24) give 

2/1 



1(^17-1), y 2 = ^(-Vl7-l 



^1 = 22/1 + V 1 + i2/i, wi = 22/2 + V 1 + \v\- 
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We may readily construct segments of these lengths. Evidently a/17 is the 
length of the hypotenuse of a right triangle whose legs are of lengths 1 and 4, 
while for the radical in z\ we employ legs of lengths 1 and \y\. We thus obtain 
segments representing the coefficients of the quadratic equation (25). Its roots 
may be constructed as in §29. The larger root is 



v\ = 2 cos 



2tt 
17' 



Hence we can construct angle 27r/17 with ruler and compasses, and therefore 
a regular polygon of 17 sides. 



40. Construction of a Regular Polygon of 17 Sides. In a circle 
of radius unity, construct two perpendicular diameters AB, CD, and draw 
tangents at A, D, which intersect at S (Fig. 11). Find the point E in AS for 
which AE = \AS, by means of two bisections. Then 



AE 



i 

4< 



OE 



17. 



Let the circle with center E and radius OE cut AS at F and F'. Then 

AF = EF - EA = OE - \ = \ yi , 
AF' = EF' + EA = OE+l = -\y 2 , 



OF=y/OA2 + AF* = Jl + hl OF' = Jl + \yl 



m- 



Let the circle with center F and radius FO cut AS at H, outside of F'F; that 
with center F' and radius F'O cut AS at H' between F' and F. Then 



AH = AF + FH = AF + OF= \ Vl + yjl + \y\ = Zl , 
AH' = F'H' - F'A = OF' - AF' = Wl . 
B 




F' S 
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It remains to construct the roots of equation (25). This will be done as 
in §29. Draw HTQ parallel to AO and intersecting OC produced at T. Make 
TQ = AH'. Draw a circle having as diameter the line BQ joining B = (0, 1) 
with Q = {z\,w\). The abscissas ON and OM of the intersections of this 
circle with the x-axis OT are the roots of (25). Hence the larger root v\ is 
OM = 2 cos(2tt/17). 

Let the perpendicular bisector LP of OM cut the initial circle of unit radius 

at P. Then 

cos LOP = OL = cos—, LOP = —. 

17 17 

Hence the chord CP is a side of the inscribed regular polygon of 17 sides, 
constructed with ruler and compasses. 

41. Regular Polygon of n Sides. If n be a prime such that n — 1 is 
a power 2 h of 2 (as is the case when n — 3, 5, 17), the n — 1 imaginary nth 
roots of unity can be separated into 2 sets each of 2 ft_1 roots, each of these 
sets subdivided into 2 sets each of 2 h ~ 2 roots, etc., until we reach the pairs R, 
1/R and R 2 , 1/i? 2 , etc., and in fact 1 in such a manner that we have a series of 
quadratic equations, the coefficients of any one of which depend only upon the 
roots of quadratic equations preceding it in the series. Note that this was the 
case for n = 17 and for n = 5. It is in this manner that it can be proved that 
the roots of x n = 1 can be found in terms of square roots, so that a regular 
polygon of n sides can be inscribed by ruler and compasses, provided n be a 
prime of the form 2^ + 1. 

If n be a product of distinct primes of this form, or 2 k times such a product 
(for example, n = 15, 30 or 6), or if n = 2 m (m > l), it follows readily (see 
Ex. 1 below) that we can inscribe with ruler and compasses a regular polygon 
of n sides. But this is impossible for all other values of n. 



EXERCISES 

1. If a and b are relatively prime numbers, so that their greatest common divisor 
is unity, we can find integers c and d such that ac-\- bd = 1. Show that, if regular 
polygons of a and b sides can be constructed and hence angles 2-k/o, and 2ir/b, a 
regular polygon of a ■ b sides can be derived. 

2. If p = 2 + 1 is a prime, h is a power of 2. For h = 2°, 2 1 , 2 2 , 2 3 , the values 
of p are 3, 5, 17, 257 and are primes. [Show that h cannot have an odd factor other 
than unity] 



"^ee the author's article "Constructions with ruler and compasses; regular polygons," in 
Monographs on Topics of Modern Mathematics, Longmans, Green and Co., 1911, p. 374. 
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3. For 13th roots of unity find the least g (§38), write out the three periods each 
of four terms, and find the cubic equation having them as roots. 

4. For the primitive ninth roots of unity find the least g and write out the three 
periods each of two terms. 

Solve the following reciprocal equations: 

5. y 4 + 4y 3 - 3y 2 + 4y + 1 = 0. 6. y 5 - 4y 4 + y 3 + y 2 - Ay + 1 = 0. 
7. 2y 6 -5y 5 + 4y 4 -4y 2 + 5y-2 = 0. 8. y 5 + 1 = 31(y + l) 5 . 



CHAPTER IV 

Solution of Cubic and Quartic Equations; Their Discriminants 

42. Reduced Cubic Equation. If, in the general cubic equation 

(1) x 3 + bx 2 + ex + d = 0, 

we set x = y — b/3, we obtain the reduced cubic equation 

(2) y : i +py + q = o, 
lacking the square of the unknown y, where 

, N b 2 , be 2fc 3 

(3) p = c ~r q = d -J + ^7- 

After finding the roots y l7 y2, y% of (2), we shall know the roots of (1): 

(A\ b b b 

(4) xi = yi--, x 2 = y2-^> x 3 = y s - -. 

43. Algebraic Solution of the Reduced Cubic Equation. We shall 
employ the method which is essentially the same as that given by Vieta in 1591. 
We make the substitution 

(5) y = z ~h 

in (2) and obtain 

since the terms in z cancel, and likewise the terms in 1/z. Thus 
(6) / + gz 3_|! =0 . 

Solving this as a quadratic equation for z 3 , we obtain 

(7) j-i±vr, Mi) s +r 
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By §8, any number has three cube roots, two of which are the products of 
the remaining one by the imaginary cube roots of unity: 

(8) u = -\ + \\/3i, u? = -\ - IVZi. 

We can choose particular cube roots 



(9) A=^- q - + VR, B=^J- q --VR, 

such that AB = —p/3, since the product of the numbers under the cube root 
radicals is equal to (—p/3) 3 . Hence the six values of z are 

A, ujA, uj 2 A, B, ojB, u?B. 

These can be paired so that the product of the two in each pair is — p/3: 

AB = - V -, ujA-uj 2 B = -^, uj 2 A-coB = -^. 

3 3 3 

Hence with any root z is paired a root equal to —p/(3z). By (5), the sum of 
the two is a value of y. Hence the three values of y are 

(10) y 1 = A + B, y 2 = LoA + uj 2 B, y- A = uj 2 A + ujB. 

It is easy to verify that these numbers are actually roots of (2). For example, 
since w 3 = 1, the cube of y 2 is 

A 3 + B' 3 + 3luA 2 B + 3lo 2 AB 2 = -q - p(ujA + uj 2 B) = -q - py 2 , 

by (9) and AB = -p/3. 

The numbers (10) are known as Cardan's formulas for the roots of a reduced 
cubic equation (2). The expression A + B for a root was first published by 
Cardan in his Ars Magna of 1545, although he had obtained it from Tartaglia 
under promise of secrecy 

Example. Solve y 3 - 15y — 126 = 0. 

Solution. The substitution (5) is here y = z + 5/z. We get 

z 6 - 126z 3 + 125 = 0, z 3 = 1 or 125. 

The pairs of values of z whose product is 5 are 1 and 5, lo and 5u> 2 , uj 2 and 5a; . Their 
sums 6, uj + 5uj 2 , and uj 2 + 5lo give the three roots. 

EXERCISES 

Solve the equations: 
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1. y 3 - 18y + 35 = 0. 2. x 3 + 6x 2 + 3x + 18 = 0. 

3. y 3 - 2y + 4 = 0. 4. 28x 3 + 9x 2 - 1 = 0. 



44. Discriminant. The product of the squares of the differences of the 
roots of any equation in which the coefficient of the highest power of the 
unknown is unity shall be called the discriminant of the equation. For the 
reduced cubic (2), the discriminant is 

(11) (3/1 " V2) 2 (yi - Vi) 2 {V2 - ysf = -# 3 - 27q 2 , 

a result which should be memorized in view of its important applications. It 
is proved by means of (10) and ufi = 1, to 2 + u + 1 = 0, as follows: 

VI ~ V2 = (1 - w)(A - uj 2 B), yi - i/3 = (1 - ^ 2 ){A - uB), 

y 2 -y:i = (u-uj 2 )(A-B), 
(1 - lu)(1 - uj 2 ) = 3, uj-cu 2 = V3i. 
Since 1, u, u 2 are the cube roots of unity, 

(x — l)(x — uj)(x — UJ ) = X — 1, 
identically in x. Taking x = A/B, we see that 

(A -B)(A- loB)(A - uj 2 B) = A* - B 3 = 2VR, 

by (9). Hence 

(vi - V2){yi - y:i){y2 - 2/3) = QVs^/Ri. 
Squaring, we get (11), since — 108i? = — 4p 3 — 27q 2 by (7). For later use, we note 
that the discriminant of the reduced cubic is equal to — 108R. 

The discriminant A of the general cubic (1) is equal to the discriminant of 
the corresponding reduced cubic (2). For, by (4), 

xi - %2 = yi ~ 2/2, xi - x 3 = yi - ys, x 2 - x 3 = y 2 - ys- 

Inserting in (11) the values of p and q given by (3), we get 

(12) A = 18bcd - 4b 3 d + b 2 c 2 - 4c 3 - 27d 2 . 
It is sometimes convenient to employ a cubic equation 

(13) ax 3 + bx 2 + ex + d = (a / 0), 

in which the coefficient of x 3 has not been made unity by division. The product P of 
the squares of the differences of its roots is evidently derived from (12) by replacing 
b, c, d by b/a, c/a, d/a. Hence 

(14) a 4 P = 18abcd - Ab 3 d + b 2 c 2 - 4ac 3 - 27a 2 d 2 . 
This expression (and not P itself) is called the discriminant of (13). 
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45. Number of Real Roots of a Cubic Equation. A cubic equation 
with real coefficients has three distinct real roots if its discriminant A is posi- 
tive, a single real root and two conjugate imaginary roots if A is negative, and 
at least two equal real roots if A is zero. 

If the roots xi, X2, £3 are all real and distinct, the square of the difference 
of any two is positive and hence A is positive. 

If x\ and X2 are conjugate imaginaries and hence X3 is real (§21), (x\ — X2) 2 
is negative. Since x\ — X3 and £2 — £3 are conjugate imaginaries, their product 
is positive. Hence A is negative. 

If x\ = X2, A is zero. If X2 were imaginary, its conjugate would be equal 
to X3 by §21, and X2, £3 would be the roots of a real quadratic equation. 
The remaining factor x — x\ of the cubic would have real coefficients, whereas 
x\ = X2 is imaginary. Hence the equal roots must be real. 

Our theorem now follows from these three results by formal logic. For 
example, if A is positive, the roots are all real and distinct, since otherwise 
either two would be imaginary and A would be negative, or two would be 
equal and A would be zero. 



EXERCISES 

Compute the discriminant and find the number of real roots of 
1. y 3 - 2y - 4 = 0. 2. y 3 - 15y + 4 = 0. 

3. y 3 - 27y + 54 = 0. 4. x 3 + Ax 2 - lis + 6 = 0. 

5. Show by means of §21 that a double root of a real cubic is real. 



46. Irreducible Case. When the roots of a real cubic equation are all 
real and distinct, the discriminant A is positive and R = —A/108 is negative, 
so that Cardan's formulas present the values of the roots in a form involving 
cube roots of imaginaries. This is called the irreducible case since it may be 
shown that a cube root of a general complex number cannot be expressed in 
the form a + bi, where a and b involve only real radicals. 1 While we cannot 
always find these cube roots algebraically, we have learned how to find them 
trigonometrically (§8). 

Example. Solve the cubic equation (2) when p = —12, q = — 8v2- 



Author's Elementary Theory of Equations, pp. 35, 36. 
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Solution. By (7), R = — 32 Hence formulas (9) become 



A = y W2 + 4\/2i, 5 = y4V2 - 4\/2t. 

The values of .A were found in §8. The values of -E? are evidently the conjugate 
imaginaries of the values of A. Hence the roots are 

4 cos 15°, 4 cos 135°, 4 cos 255°. 



EXERCISES 

1. Solve y 3 - 15y + 4 = 0. 2. Solve y 3 - 2y - 1 = 0. 

3. Solve y 3 - ly + 7 = 0. 4. Solve x 3 + 3x 2 - 2x - 5 = 0. 

5. Solve x 3 + x 2 -2x-l = 0. 6. Solve x 3 + 4x 2 - 7 = 0. 



47. Trigonometric Solution of a Cubic Equation with A > 0. When 
the roots of a real cubic equation are all real, i.e., if R is negative, they can be 
computed simultaneously by means of a table of cosines with much less labor 
than required by Cardan's formulas. To this end we write the trigonometric 
identity 



cos 3^4 = 4 cos 3 A - 3 cos A 



in the form 



.3 3, 



i'-^z—^cos3A = (z = cosA). 
In the given cubic y 3 + py + q = take y = nz; then 

z*+Pz + «=0, 
which will be identical with the former equation in z if 



n 



fp, cos 3,4= -\q^rJ-p i /27. 



Since R = p 3 /27 + q 2 /4 is negative, p must be negative, so that n is real and 
the value of cos 3A is real and numerically less than unity. Hence we can find 
3 A from a table of cosines. The three values of z are then 

cos.4, cos(.4 + 120 o ), cos(y4 + 240°). 

Multiplying these by n, we obtain the three roots y correct to a number of 
decimal places which depends on the tables used. 
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EXERCISES 

1. For y 3 - 2y - 1 = 0, show that n 2 = 8/3, cos3A = ^27/32, 3A = 23°17'0 // , 
cos A = 0.99084, cos(,4 + 120°) = -0.61237, cos(^ + 240°) = -0.37847, and that 
the roots y are 1.61804, -1, -0.61804. 

2. Solve Exs. 1, 3, 4, 5, 6 of §46 by trigonometry. 

48. Ferrari's Solution of the Quartic Equation. The general quartic 
equation 

(15) x 4 + bx 3 + ex 2 + dx + e = 0, 

or equation of degree four, becomes after transposition of terms 

x + bx = —ex — dx — e. 

The left member contains two of the terms of the square of x 2 + \bx. Hence 
by completing the square, we get 

{x 2 + \bx) 2 = (lb 2 - c)x 2 -dx-e. 

Adding (x 2 + \bx)y + \y 2 to each member, we obtain 

(16) (x 2 + \bx + \yf = (lb 2 - c + y)x 2 + (\by - d)x + \y 2 - e. 

The second member is a perfect square of a linear function of x if and only if 
its discriminant is zero (§12): 

(\by - d) 2 - A(\b 2 - c + y)(\y 2 - e) = 0, 

which may be written in the form 

(17) y 3 - cy 2 + (bd - 4e)y - b 2 e + 4ce - d 2 = 0. 

Choose any root y of this resolvent cubic equation (17). Then the right 
member of (16) is the square of a linear function, say mx + n. Thus 



(18) x + \bx + \y = mx + n or x + \bx + \y 



-mx — n. 



The roots of these quadratic equations are the four roots of (16) and hence 
of the equivalent equation (15). This method of solution is due to Ferrari 
(1522-1565). 

Example. Solve x 4 + 2x 3 - Ylx 1 - lOx + 3 = 0. 
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Solution. Here b = 2, c = — 12, d = — 10, e = 3. Hence (17) becomes 

y 3 + 12y 2 - 32y - 256 = 0, 

which by Ex. 2 of §24 has the root y = —4. Our quartic may be written in the form 

(x 2 + x ) 2 = 13a; 2 + lOx - 3. 

Adding (x 2 + x)(— 4) + 4 to each member, we get 

(x 2 + x - 2) 2 = 9x 2 + 6x + 1 = (3x + l) 2 , 
x 2 + x - 2 = ±(3x + 1), x 2 - 2x - 3 = or x 2 + 4x - 1 = 0, 

whose roots are 3, — 1, —2 ± s/E. As a check, note that the sum of the roots is —2. 



EXERCISES 

1. Solve x 4 - 8x 3 + 9x 2 + 8x - 10 = 0. Note that (17) is (y - 9)(y 2 - 24) = 0. 

2. Solve x 4 — 2x 3 — 7x 2 + 8x + 12 = 0. Since the right member of (16) is 
- y)(x 2 - x) + ly 2 - 12, use y = -8. 

3. Solve x 4 - 3x 2 + 6x - 2 = 0. 

4. Solve x 4 - 2x 2 - 8x - 3 = 0. 

5. Solve x 4 - 10x 2 - 20x - 16 = 0. 



49. Roots of the Resolvent Cubic Equation. Let y\ be the root y 
which was employed in §48. Let x\ and X2 be the roots of the first quadratic 
equation (18), and X3 and X4 the roots of the second. Then 

x\X2 = \yi-n, x 3 X4 = ^yi + n, xix 2 + x 3 x 4 = y\. 

If, instead of yi, another root 2/2 or 2/3 of the resolvent cubic (17) had been 
employed in §48, quadratic equations different from (18) would have been 
obtained, such, however, that their four roots are xi, x 2 , £3, X4, paired in a 
new manner. The root which is paired with x\ is x 2 or X3 or X4. It is now 
plausible that the values of the three y's are 

(19) yi = xix 2 + X3X4, y 2 = X1X3 + x 2 x 4 , y 3 = X1X4 + x 2 x 3 . 
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To give a more formal proof that the y's given by (19) are the roots of (17), 
we employ (§20) 

x\ + X2 + x-,>, + x^ = -b, x\X2X'i + :riX2£4 + X1X3X4 + X2X3X4 = -d, 

x\X2 + X1X3 + X1X4 + X2X3 + X2X4 + X3X4 = c, X1X2X3X4 = e. 

From these four relations we conclude that 

2/1 + 2/2 + 2/3 = c, 

2/12/2 + 2/12/3 + 2/22/3 = (xi + x 2 + x 3 + X4)(x!X2X 3 -\ \- x 2 x 3 x 4 ) - 4xix 2 x 3 x 4 

= bd- 4e, 

2/12/22/3 = (^l^^ H ) 2 + xix 2 x 3 x 4 {(xi H ) 2 - 4(x x x 2 H )} 

= d 2 + e(6 2 -4c). 

Hence (§20) j/i, 2/2, 2/3 are the roots of the cubic equation (17). 

50. Discriminant. The discriminant A of the quartic equation (15) is 
defined to be the product of the squares of the differences of its roots: 



A = (xi - x 2 ) (xi - x 3 ) (xi - x 4 ) (x 2 - x 3 ) (x 2 - x 4 ) (x 3 - x 4 ) . 

The fact that A is equal to the discriminant of the resolvent cubic equa- 
tion (17) follows at once from (19), by which 

2/1 - 2/2 = (xi ~ X4)(x 2 - x 3 ), 2/1 - 2/3 = (xi ~ xs)(x2 - x 4 ), 

2/2 - 2/3 = (xi ~ x 2 )(x3 - x 4 ), (yi - y 2 ) 2 (2/l - 2/3) 2 (2/2 - V'i) 2 = A. 

Hence (§44) A is equal to the discriminant — 4p 3 — 27q 2 of the reduced cubic 
Y 3 + pY + q = 0, obtained from (17) by setting y = Y + c/3. Thus 

(20) p = bd-4e-^c 2 , q = -b 2 e + \bcd+ fee - d 2 - ^c 3 . 

THEOREM. The discriminant of any quartic equation (15) is equal to the 
discriminant of its resolvent cubic equation and therefore is equal to the dis- 
criminant — 4p 3 — 27q 2 of the corresponding reduced cubic Y 3 + pY + q = 0, 
whose coefficients have the values (20). 
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EXERCISES 

1 . Find the discriminant of x 4 — 3x 3 + x 2 + 3x — 2 = and show that the equation 
has a multiple root. 

2. Show by its discriminant that x 4 — 8x 3 + 22x 2 — 24x + 9 = has a multiple 
root. 

3. If a real quartic equation has two pairs of conjugate imaginary roots, show 
that its discriminant A is positive. Hence prove that, if A < 0, there are exactly 
two real roots. 

4. Hence show that x 4 — 3x 3 + 3x 2 — 3x + 2 = has two real and two imaginary 
roots. 

51. Descartes' Solution of the Quartic Equation. Replacing x by 
z — 6/4 in the general quartic (15), we obtain the reduced quartic equation 

(21) z A + qz 2 + rz + s = 0, 

lacking the term with z 3 . We shall prove that we can express the left member 
of (21) as the product of two quadratic factors 

(z 2 + 2kz + l)(z 2 - 2kz + to) = z 4 + (I + m - Ak 2 )z 2 + 2k(m - l)z + Im. 

The conditions are 

I + m — 4k = q, 2k(m — I) = r, Im = s. 

If k 7^ 0, the first two give 

2l = q + 4k 2 , 2m = q + 4k 2 ^ . 

2k 2k 

Inserting these values in 21 ■ 2m = 4s, we obtain 

(22) 64k 6 + 32qk 4 + 4(q 2 - 4s)k 2 - r 2 = 0. 

The latter may be solved as a cubic equation for k 2 . Any root k 2 / gives a 
pair of quadratic factors of (21): 

(23) z 2 ±2kz+ ±q + 2k 2 T — ■ 

z 4k 

The four roots of these two quadratic functions are the four roots of (21). This 
method of Descartes (1596-1650) therefore succeeds unless every root of (22) 
is zero, whence q = s = r = 0, so that (12) is the trivial equation z 4 = 0. 
For example, consider z 4 — 3z 2 + Qz — 2 = 0. Then (22) becomes 

64k 6 - 3 ■ 32k 4 + 4 ■ 17k 2 - 36 = 0. 

The value k 2 = 1 gives the factors z 2 + 2z — 1, z 2 — 2z + 2. Equating these to zero, 
we hnd the four roots — 1 ± V2, 1 ± \f—\. 
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52. Symmetrical Form of Descartes' Solution. To obtain this sym- 
metrical form, we use all three roots kf, k%, &§ of (22). Then 

k{ + kl + kl = -\q, k\k\k\= r -. 

It is at our choice as to which square root of k\ is denoted by -\-k\ and which 
by — fci, and likewise as to ±k2, ±&3- For our purposes any choice of these 
signs is suitable provided the choice give 

(24) hk 2 h = - r -. 

Let k\ / 0. The quadratic function (23) is zero for k = k\ if 

Hence the four roots of the quartic equation (21) are 

(25) k\ + &2 + &3> fci — fe2 - &3, -fci + k 2 - ks, -k\ - ki + £3. 



EXERCISES 

1. Solve Exs. 4, 5 of §48 by the method of Descartes. 

2. By writing y\, 1/2, 2/3 for the roots k 2 , k%, k 2 of 

(26) 64y 3 + 32gy 2 + 4(q 2 - As)y - r 2 = 0, 
show that the four roots of (21) are the values of 

(27) z = y/yi + ^2/2 + Vvs 

for all combinations of the square roots for which 

r 

(28) Vm- Vm- Vm = ~v- 



3. Euler (1707 1783) solved (21) by assuming that it has a root of the form (27). 
Square (27), transpose the terms free of radicals, square again, replace the last factor 
°f 8^/^1^22/3 (\/Vi + \/y~2 J r \/y~3) by z, and identify the resulting quartic in z with (21) . 
Show that 2/1, 2/2, 2/3 are the roots of (26) and that relation (28) holds. 

4. Find the six differences of the roots (25) and verify that the discriminant A 
of (21) is equal to the quotient of the discriminant of (26) by 4 6 . 
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5. In the theory of the inflexion points of a plane cubic curve there occurs the 
equation 

z 4 - Sz 2 - \Tz - ^S 2 = 0. 

Show that (26) now becomes 

S\ 3 „ „ (T\ 2 /5 x3 



*-*) = c - c -ys 

and that the roots of the quartic equation are 



where uj is an imaginary cube root of unity and the signs are to be chosen so that 
the product of the three summands is equal to +\T. 



MISCELLANEOUS EXERCISES 

1. Find the coordinates of the single real point of intersection of the parabola 
y = x 2 and the hyperbola xy — Ax + y + 6 = 0. 

2. Show that the abscissas of the points of intersection of y = x 2 and ax 2 — xy + 
y 2 — x — (a + h)y — 6 = are the roots of x 4 — x 3 — 5x 2 — x — 6 = 0. Compute the 
discriminant of the latter and show that only two of the four points of intersection 
are real. 

3. Find the coordinates of the two real points in Ex. 2. 

4. A right prism of height h has a square base whose side is b and whose diagonal 
is therefore b\/2. If v denotes the volume and d a diagonal of the prism, v = hb 2 
and d 2 = h 2 + (&V2) 2 - Multiply the last equation by h and replace hb 2 by v. Hence 
h s — d 2 h + 2v = 0. Its discriminant is zero if d = 3v3, v = 27; find h. 

5. Find the admissible values of h in Ex. 4 when d = 12, v = 332.5. 

6. Find a necessary and sufficient condition that quartic equation (15) shall 
have one root the negative of another root. 

Hint: {x\ + X2){xz + X4) = q — y\. Hence substitute q for y in (17). 

7. In the study of parabolic orbits occurs the equation 

tan T^v + i tan ^v = t. 
Prove that there is a single real root and that it has the same sign as t. 

8. In the problem of three astronomical bodies occurs the equation x^ + ax + 2 = 
0. Prove that it has three real roots if and only if a ^ —3. 



CHAPTER V 

The Graph of an Equation 

53. Use of Graphs in the Theory of Equations. To find geometri- 
cally the real roots of a real equation f(x) = 0, we construct a graph of y = f(x) 
and measure the distances from the origin O to the intersections of the graph 
and the x-axis, whose equation is y = 0. 

For example to find geometrically the 
real roots of ("M)* Y ^ 7 ' 4 ) 



(1) 



Qx - 3 = 0, 



we equate the left member to y and make a 
graph of 

2 



(l' 



y 



6x — 3. 



We obtain the parabola in Fig. 12. Of the 
points shown, P has the abscissa x = OQ = 4 
and the ordinate y = —QP = — 11. From the 
points of intersection of y = (the x-axis 
OX) with the parabola, we obtain the ap- 
proximate values 6.46 and —0.46 of the roots 
of(l). 

EXERCISES 



(0,-3) 




1. Find graphically the real roots of x 2 — 6x + 7 = 0. 

Hint: For each x, y = x 2 — 6x + 7 exceeds the y in (F) by 10, so that the new 
graph is obtained by shifting the parabola in Fig. 12 upward 10 units, leaving the 
axes OX and OY unchanged. What amounts to the same thing, but is simpler to 
do, we leave the parabola and OY unchanged, and move the axis OX downward 
10 units. 

2. Discuss graphically the reality of the roots of x 2 — 6x + 12 = 0. 

3. Find graphically the roots of x 2 — 6x + 9 = 0. 
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(2) 



54. Caution in Plotting. If the example set were 



y = 8x 4 - 14x 3 - 9x 2 + llx - 2, 



one might use successive integral values of x, obtain the 
points (-2,180), (-1,0), (0,-2), (1,-6), (2,0), (3,220), all 
but the first and last of which are shown (by crosses) in 
Fig. 13, and be tempted to conclude that the graph is a 
U-shaped curve approximately like that in Fig. 12 and that 
there are just two real roots, —1 and 2, of 



(2') 



8x 4 



Ux c 



9x z + llx- 2 = 0. 



But both of these conclusions would be false. In fact, the 
graph is a W-shaped curve (Fig. 13) and the additional 
real roots are \ and \. 

This example shows that it is often necessary to employ 

also values of x which are not integers. The purpose of 

the example was, however, not to point out this obvious 

fact, but rather to emphasize the chance of serious error 

in sketching a curve through a number of points, however 

numerous. The true curve between two points below the x-axis may not cross 

the x-axis, or may have a peak and actually cross the x-axis twice, or may be 

an M-shaped curve crossing it four times, etc. 

For example, the graph (Fig. 14) of 



Fig. 13 



(3) 



x 3 + 4x 2 



11 



crosses the x-axis only once; but this fact cannot be 
established by a graph located by a number of points, 
however numerous, whose abscissas are chosen at ran- 
dom. 

We shall find that correct conclusions regarding the 
number of real roots may be deduced from a graph 
whose bend points (§55) have been located. 

55. Bend Points. A point (like M or M' in 
Fig. 14) is called a bend point of the graph of y = f(x) 
if the tangent to the graph at that point is horizontal 
and if all of the adjacent points of the graph lie below 
the tangent or all above the tangent. The first, but 
not the second, condition is satisfied by the point O of 




Fig. 14 



555. 



BEND POINTS 
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O 



X 




Fig. 15 



Fig. 16 



the graph of y 



given in Fig. 15 (see §57). In the 



language of the calculus, f(x) has a (relative) maximum or minimum value at 
the abscissa of a bend point on the graph of y = f(x). 

Let P = (x, y) and Q = (x + h, Y) be two points on the graph, sketched in 
Fig. 16, of y = f(x). By the slope of a straight line is meant the tangent of the 
angle between the line and the x-axis, measured counter-clockwise from the 
latter. In Fig. 16, the slope of the straight line PQ is 



(4) 



Y-y f(x + h)-f(x) 



h h 

For equation (3), f(x 

f(x + h) = (x + h) 3 + 4(x + h) 2 



x i _|_ 4x 2 _ -q_ Hence 



11 



= x 3 + Ax 2 - 11 + (3x 2 + 8x)h + (3s + 4)/i 2 + /i 3 . 
The slope (4) of the secant PQ is therefore here 

3x 2 + 8x + (3x + 4)h + h 2 . 

Now let the point Q move along the graph toward P. Then h approaches the 
value zero and the secant PQ approaches the tangent at P. The slope of the 
tangent at P is therefore the corresponding limit 3x 2 + 8x of the preceding 
expression. We call 3x 2 + 8x the derivative of x 3, + Ax 2 — 11. 

In particular, if P is a bend point, the slope of the (horizontal) tangent 
at P is zero, whence 3x 2 + 8x = 0, i = or a; = — |. Equation (3) gives the 
corresponding values of y. The resulting points 



M = (0,-11) 



M' = (- 



3' 



.41^ 

27/ 
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are easily shown to be bend points. Indeed, for x > and for x between —4 
and 0, x 2 (x + 4) is positive, and hence /(x) > — 11 for such values of x, so that 
the function (3) has a relative minimum at x = 0. Similarly, there is a relative 
maximum at x = — |. We may also employ the general method of §59 to show 
that M and M' are bend points. Since these bend points are both below the 
x-axis we are now certain that the graph crosses the x-axis only once. 

The use of the bend points insures greater accuracy to the graph than the 
use of dozens of points whose abscissas are taken at random. 

56. Derivatives. We shall now find the slope of the tangent to the graph 
of y = f(x), where /(x) is any polynomial 

(5) f(x) = aQX n + a\x n ~ -\ \- a n _ 1 x + a n . 

We need the expansion of f(x + h) in powers of x. By the binomial theorem, 

a (x + h) n = a x n + na x n ~ 1 h + — y — -a x n ~ 2 h 2 -\ , 



ai(x + h) 71 ' 1 = oix 71 " 1 + (n - l)aix n - 2 h + ^ ^ 2) aix n ~ 3 /i 2 + • • • , 

a n _ 2 (x + h) 2 = a n _ 2 x 2 + 2a n _ 2 x/i + a n _ 2 /i 2 , 
a n -l(x + h) = a n -\x + a n -\h, 
a n = a n . 

The sum of the left members is evidently f(x + h). On the right, the sum of 

the first terms (i.e., those free of h) is f(x). The sum of the coefficients of h 

is denoted by f'(x), the sum of the coefficients of \f? is denoted by f"(x), . . . , 

the sum of the coefficients of 

h k 

l-2---k 
is denoted by f( k >(x). Thus 

(6) f(x) = naQX n ~ + (n - l)aix n ~ -\ \- 2a n _ 2 x + o n _i, 

(7) f"( x ) = n (n - l)a x n - 2 + (n - l)(n - 2)aix n " 3 + • • • + 2a n _ 2 , 

etc. Hence we have 

(8) /(x + h) = f{z) + f'(x)h + /"(*) j^ + /"'(^ 1^3 



r! n! 
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where r! is the symbol, read r factorial, for the product 1 • 2 • 3 • • • (r — l)r. Here 
r is a positive integer, but we include the case r = by the definition, 0! = 1. 

This formula (8) is known as Taylor's theorem for the present case of a 
polynomial /(x) of degree n. We call f'(x) the (first) derivative of f(x), and 
f"(x) the second derivative of /(x), etc. Concerning the fact that f"(x) is equal 
to the first derivative of f'(x) and that, in general, the /cth derivative f( k >(x) 
of f(x) is equal to the first derivative of /( fe_1 )(x), see Exs. 6-9 of the next set. 

In view of (8), the limit of (4) as h approaches zero is f'(x). Hence f'(x) is 
the slope of the tangent to the graph of y = f(x) at the point (x,y). 

In (5) and (6), let every a be zero except uq. Thus the derivative of a^x 11 is 
naox n_1 , and hence is obtained by multiplying the given term by its exponent n 
and then diminishing its exponent by unity. For example, the derivative of 2x 3 
is 6x 2 . 

Moreover, the derivative of f(x) is equal to the sum of the derivatives of 
its separate terms. Thus the derivative of x 3 + 4x 2 — 11 is 3x 2 + 8x, as found 
also in §55. 

EXERCISES 

1. Show that the slope of the tangent to y = 8x 3 — 22x 2 + 13x — 2 at (x, y) is 
24x 2 - 44x + 13, and that the bend points are (0.37, 0.203), (1.46, -5.03), approxi- 
mately. Draw the graph. 

2. Prove that the bend points of y = x 3 -2x-5 are (.82,-6.09), (-.82, -3.91), 
approximately. Draw the graph and locate the real roots. 

3. Find the bend points of y = x 3 + 6x 2 + 8x + 8. Locate the real roots. 

4. Locate the real roots of f(x) = x 4 + x 3 — x — 2 = 0. 

Hints: The abscissas of the bend points are the roots of f'(x) = 4x 3 + 3x 2 — 1 = 0. 
The bend points of y = f(x) are (0,-1) and (— ^,— |), so that f'(x) = has a 

(it is just less than ^)- The single b 
approximately. 

5. Locate the real roots of x 6 — 7x 4 — 3x 2 + 7 = 0. 

6. Prove that f"(x), given by (7), is equal to the first derivative of f'(x). 

7. If f(x) = f\(x) + /2(x), prove that the &th derivative of / is equal to the 
sum of the A;th derivatives of f\ and /2. Use (8). 

8. Prove that f^\x) is equal to the first derivative of /( >(x). Hint: prove 
this for / = ax m ; then prove that it is true for / = /i + /2 if true for f\ and /2- 



single real root (it is just less than ^)- The single bend point of y = f(x) is (5, — f|), 
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9. Find the third derivative of x 6 + 5x 4 by forming successive first derivatives; 
also that of 2x 5 — 7x 3 + x. 

10. Prove that if g and k are polynomials in x, the derivative of gk is g'k + gk! . 
Hint: multiply the members of g{x + h) = g{x) + g'{x)h + • • • and k(x + h) = 
k{x) + k'{x)h + • • • and use (8) for / = gk. 



57. Horizontal Tangents. If (x, y) is a bend point of the graph of y = 
f(x), then, by definition, the slope of the tangent at (x,y) is zero. Hence (§56), 
the abscissa x is a root of f'(x) = 0. In Exs. 1-5 of the preceding set, it 
was true that, conversely, any real root of f'(x) = is the abscissa of a bend 
point. However, this is not always the case. We shall now consider in detail 
an example illustrating this fact. The example is the one merely mentioned 
in §55 to indicate the need of the second requirement made in our definition 
of a bend point. 

The graph (Fig. 15) of y = x 3 has no bend point since x 3 increases when 
x increases. Nevertheless, the derivative 3x 2 of x 3 is zero for the real value 
x = 0. The tangent to the curve at (0, 0) is the horizontal line y = 0. It may be 
thought of as the limiting position of a secant through O which meets the curve 
in two further points, seen to be equidistant from O. When one, and hence 
also the other, of the latter points approaches O, the secant approaches the 
position of tangency. In this sense the tangent at O is said to meet the curve 
in three coincident points, their abscissas being the three coinciding roots of 
x 3 = 0. In the language of §17, x 3 = has the triple root x = 0. The subject of 
bend points, to which we recur in §59, has thus led us to a digression on the 
important subject of multiple roots. 

58. Multiple Roots. In (8) replace x by a, and h by x — a. Then 

(9) /(*) = f{a) + f'(a)(x - a) + f'\ a )^f + /"'( a )^£ + . . . 

+ f^^ha) ^' a)m ~ 1 + /( m W g ~" )m + ' ' ' • 
(to — 1)! m,! 

By definition (§17) a is a root of /(x) = of multiplicity to if f(x) is exactly 
divisible by (x — a) m , but not by (x — a) m+1 . Hence a is a root of multiplicity to 
of f(x) = if and only if 

(10) f(a) = 0, f'(a)=0, f"(a) = 0,..., /("^(a) = 0, /M(a) + 0. 

For example, x 4 + 2x 3 = has the triple root x = since is a root, and since 
the first and second derivatives 4x 3 + 6x 2 and 12x 2 + 12x are zero for x = 0, while 
the third derivative 24x + 12 is not zero for x = 0. 
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If in (9) we replace / by /' and hence /W by /*- +1 ), or if we differentiate 
every term with respect to x, we see by either method that 

(11) f'{x) = f'(a) + f"(a)(x -«) + ••• + / (m ~ 1} («) ( m _ 2) , 

f(m),s (x-<x) m + ... 
^ J l J (m-1)! 

Let /(x) and /'(x) have the common factor (x — a) m_1 , but not the common 
factor (x — a) m , where m > 1. Since (11) has the factor (x — a) m_1 , we have 
f'{a) = 0, . . . , /( m-1 )(a) = 0. Since also /(x) has the factor x — a, evidently 
/(a) = 0. Then, by (9), f(x) has the factor (x — a) m , which, by hypothesis, is 
not also a factor of f'(x). Hence, in (11), f^ m \a) / 0. Thus, by (10), a is a 
root of f(x) = of multiplicity m. 

Conversely, let a be a root of f(x) = of multiplicity m. Then relations (10) 
hold, and hence, by (11), /'(x) is divisible by (x — a) 171-1 , but not by (x — a) m . 
Thus f(x) and f'(x) have the common factor (x — a) 771-1 , but not the common 
factor (x — a) m . 

We have now proved the following useful result. 

THEOREM. If f(x) and f'(x) have a greatest common divisor g(x) in- 
volving x, a root of g(x) = of multiplicity m — 1 is a root of f(x) = of 
multiplicity m, and conversely any root of f(x) = of multiplicity m is a root 
of g(x) = of multiplicity m — 1. 

In view of this theorem, the problem of finding all the multiple roots of 
f(x) = and the multiplicity of each multiple root is reduced to the problem 
of finding the roots of g(x) = and the multiplicity of each. 

For example, let f(x) = x 3 — 2x 2 — 4x + 8. Then 

f'{x) = 3x 2 - 4x - 4, 9/(x) = f'(x)(3x - 2) - 32(x - 2). 

Since x — 2 is a factor of f'(x), it may be taken to be the greatest common divisor 
of f(x) and /'(x), the choice of the constant factor c in c{x — 2) being here immaterial. 
Hence 2 is a double root of f{x) = 0, while the remaining root —2 is a simple root. 

EXERCISES 

1. Prove that x 3 — 7x 2 + 15x — 9 = has a double root. 

2. Show that x 4 — 8x 2 + 16 = has two double roots. 

3. Prove that x 4 — 6x 2 — 8x — 3 = has a triple root. 

4. Test x 4 - 8x 3 + 22x 2 - 24x + 9 = for multiple roots. 
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5. Test x 3 — 6x 2 + 11a; — 6 = for multiple roots. 

6. Test x 4 — 9x 3 + 9x 2 + 81a; — 162 = for multiple roots. 



59. Ordinary and Inflexion Tangents. The equation of the straight 
line through the point (a, (3) with the slope s is y — (3 = s(x — a). The slope 
of the tangent to the graph of y = f(x) at the point (a, (3) on it is s = f'(a) 
by §56. Also, (3 = /(a). Hence the equation of the tangent is 

(12) y = f(a) + f'(a)(x-a). 

By subtracting the members of this equation from the corresponding mem- 
bers of equation (9), we see that the abscissas x of the points of intersection 
of the graph of y = f(x) with its tangent satisfy the equation 

f"(a) {x - a)2 + fig) {X ~ a)3 + ■■■ + /("-l)(q)( g - a > m ~ 1 

+ / (m) (a) (^Z^T + ... =a 
m! 

Here the term containing /( m-1 )(a) must evidently be suppressed if m = 2, 
since the term containing f^ rn \a) then coincides with the first term. 

If a is a root of multiplicity in of this equation, i.e., if the left member is 
divisible by (x — a) m , but not by (x — a) m+1 , the point (a, (3) is counted as 
m coincident points of intersection of the curve with its tangent (just as in the 
case of y = x 3 and its tangent y = in §57). This will be the case if and only 
if 

(13) f"(a)=0, f"'(a) = 0,..., /("^(a) = 0, /( m )(a) + 0, 

in which m > 1 and, as explained above, only the final relation f"(a) ^ is 
retained if m = 2. If m = 3, the conditions are f"(a) = 0, /*- 3 )(a) / 0. 

For example, if f(x) = x 4 and a = 0, then /"(0) = /'"(0) = 0, / (4) (0) = 24/0, 
so that m = 4. The graph of y = x 4 is a U-shaped curve, whose intersection with the 
tangent (the x-axis) at (0, 0) is counted as four coincident points of intersection. 

Given f(x) and a, we can find, as in the preceding example, the value of m 
for which relations (13) hold. We then apply the 

THEOREM. If m is even (m > 0), the points of the curve in the vicinity 
of the point of tangency (a, (3) are all on the same side of the tangent, which 
is then called an ordinary tangent. But if m is odd (m > l), the curve 
crosses the tangent at the point of tangency (a, (3), and this point is called an 
inflexion point, while the tangent is called an inflexion tangent. 
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For example, in Fig. 15, OX is an inflexion tangent, while the tangent at any 
point except O is an ordinary tangent. In Figs. 18, 19, 20, the tangents at the 
points marked by crosses are ordinary tangents, but the tangent at the point midway 
between them and on the y-axis is an inflexion tangent. 

To simplify the proof, we first take as new axes lines parallel to the old axes 
and intersecting at (a, (3). In other words, we set x — a = X, y — (3 = Y, where 
X, Y are the coordinates of (x, y) referred to the new axes. Since (3 = /(a), the 

tangent (12) becomes Y = f'(a)X, while, by (9), y = f(x) = f3+f'(a)(x—a)-\ 

becomes 



Y = f(a)X + f(a)^ + 



f'(a)X + f^\a 



X r 



nil 



+ ■■■ 



after omitting terms which are zero by (13). 

To simplify further the algebraic work, 
we pass to oblique axes, 1 the new y-axis 
coinciding with the F-axis, while the new 
x-axis is the tangent, the angle between 
which and the X-axis is designated by 9. 
Then 

tan 6 = f'(a). 

By Fig. 17, 



X 



a; cos l 



Y-y = f{a)X. 




Hence when expressed in terms of the 
new coordinates x, y, the tangent is y = 0, 
while the equation (14) of the curve becomes 



X X-Axis 

Fig. 17 



y 



ex 



m + dx m+1 + 



/ (m) ( 



a cos 



ml 



^0. 



For x sufficiently small numerically, whether positive or negative, the sum 
of the terms after cx m is insignificant in comparison with cx m , so that y has 
the same sign as cx m (§64). Hence, if m is even, the points of the curve in 
the vicinity of the origin and on both sides of it are all on the same side of 
the x-axis, i.e., the tangent. But, if m is odd, the points with small positive 
abscissas x lie on one side of the x-axis and those with numerically small 
negative abscissas lie on the opposite side. 

Our transformations of coordinates changed the equations of the curve and 
of its tangent, but did not change the curve itself and its tangent. Hence our 
theorem is proved. 



"^ince the earlier x, y do not occur in (14) and the new equation of the tangent, we shall 
designate the final coordinates by x, y without confusion. 
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By our theorem, a is the abscissa of an inflexion point of the graph of 
y = f(x) if and only if conditions (13) hold with m odd (m, > l). These 
conditions include neither f(a) = nor f'(a) = 0, in contrast with (10). In the 
theory of equations we are primarily interested in the abscissas a of only those 
points of inflexion whose inflexion tangents are horizontal, and are interested 
in them, because we must exclude such roots a of f'(x) = when seeking the 
abscissas of bend points, which are the important points for our purposes. A 
point on the graph at which the tangent is both horizontal and an ordinary 
tangent is a bend point by the definition in §55. Hence if we apply our theorem 
to the special case f'(a) = 0, we obtain the following 

CRITERION. Any root a of f'(x) = is the abscissa of a bend point of 
the graph ofy— f(x) or of a point with a horizontal inflexion tangent according 
as the value of m for which relations (13) hold is even or odd. 

For example, if f(x) = x 4 , then a = and m = 4, so that (0, 0) is a bend point 
of the U-shaped graph of y = x 4 . If f(x) = x 3 , then a = and m = 3, so that 
(0, 0) is a point with a horizontal inflexion tangent (OX in Fig. 15) of the graph of 
y = x 3 . 



EXERCISES 

1. If f(x) = 3x 5 + 5a: 3 + 4, the only real root of f'(x) = is x = 0. Show 
that (0, 4) is an inflexion point, and thus that there is no bend point and hence that 
f(x) = has a single real root. 

2. Prove that x 3 — 3x 2 + 3a; + c = has an inflexion point, but no bend point. 

3. Show that x 5 — 10a; 3 — 20x 2 — 15a; + c = has two bend points and no 
horizontal inflexion tangents. 

4. Prove that 3x 5 — 40x 3 + 240x+c = has no bend point, but has two horizontal 
inflexion tangents. 

5. Prove that any function x 3 — 3ax 2 + • • • of the third degree can be written in 
the form f(x) = (x — a) 3 + ax + b. The straight line having the equation y = ax + b 
meets the graph of y = f(x) in three coincident points with the abscissa a and hence 
is an inflexion tangent. If we take new axes of coordinates parallel to the old and 
intersecting at the new origin (a, 0), i.e., if we make the transformation x = X + a. 
y = Y, of coordinates, we see that the equation f(x) = becomes a reduced cubic 
equation X 3 + pX + q = (§42). 

6. Find the inflexion tangent to y = x 3 + 6x 2 — 3x + 1 and transform a; 3 + 6x 2 — 
3x + 1 = into a reduced cubic equation. 
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60. Real Roots of a Real Cubic Equation. It suffices to consider 



/(*) 



3lx + q 



(^0), 



in view of Ex. 5 above. Then /' = 
point and the cubic equation f(x 
two bend points 

(V~l,q-2lVi), (-Vl,q + 2lVi) 



3(x 2 - I), /" = 6x. If I < 0, there is no bend 
= has a single real root. If I > 0, there are 



which are shown by crosses in Figs. 18-20 for the graph of y — f(x) in the three 
possible cases specified by the inequalities shown below the figures. For a large 
positive x, the term x 3 in f(x) predominates, so that the graph contains a point 
high up in the first quadrant, thence extends downward to the right-hand bend 
point, then ascends to the left-hand bend point, and finally descends. As a 
check, the graph contains a point far down in the third quadrant, since for x 
negative, but sufficiently large numerically, the term x 3 predominates and the 
sign of y is negative. 





If the equality sign holds in Fig. 18 
or Fig. 19, a necessary and sufficient 
condition for which is q 2 = Al i 1 one of 
the bend points is on the x-axis, and the 
cubic equation has a double root. The 
inequalities in Fig. 20 hold if and only 
if q 2 < 4l 3 , which implies that I > 0. 
Hence x 3 — 3lx + q = has three distinct 




-2l\fl < q < 2lVl 
Fig. 20 



real roots if and only if q 2 < 4/ 3 , a single 

real root if and only if q 2 > 4/ 3 , a double root (necessarily real) if and only if 

q 2 = 4/ 3 and I / 0, and a triple root if q 2 = 4£ 3 = 0. 



EXERCISES 

Find the bend points, sketch the graph, and find the number of real roots of 
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1. x 3 + 2x - 4 = 0. 3. x 3 - 2x - 1 = 0. 

2. x 3 - 7x + 7 = 0. 4. x 3 + 6x 2 - 3x + 1 = 0. 

5. Prove that the inflexion point of y = x 3 — 3lx + q is (0, g). 

6. Show that the theorem in the text is equivalent to that in §45. 

7. Prove that, if m and n are positive odd integers and m > n, x m +px n + q = 
has no bend point and hence has a single real root if p > 0; but, if p < 0, it has just 
two bend points which are on the same side or opposite sides of the x-axis according 
as 

np\ m ( nq 



m / \m — n 

is positive or negative, so that the number of real roots is 1 or 3 in the respective 
cases. 

8. Draw the graph of y = x 4 — x 2 . By finding its intersections with the line 
y = rax + b, solve x 4 — x 2 — mx — 6 = 0. 

9. Prove that, if p and q are positive, x 2m — px 2n + q = has four distinct real 
roots, two pairs of equal roots, or no real root, according as 

m m - ( ^__r~ n > 0, =0, or <0. 

V m / \m — n J 

10. Prove that no straight line crosses the graph of y = /(x) in more than n points 
if the degree n of the real polynomial f(x) exceeds unity. [Apply §16.] This fact 
serves as a check on the accuracy of a graph. 



61. Definition of Continuity of a Polynomial. Hitherto we have 
located certain points of the graph of y = /(x), where /(x) is a polynomial 
in x with real coefficients, and taken the liberty to join them by a continuous 
curve. 

A polynomial /(x) with real coefficients shall be called continuous at x = a, 
where a is a real constant, if the difference 

D = f(a + h)-f(a) 

is numerically less than any assigned positive number p for all real values of h 
sufficiently small numerically. 
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62. Any Polynomial f(x) with real Coefficients is continuous at 
x = a, where a is any real Constant. Taylor's formula (8) gives 

D = /<(»)» + ^W- + ^U» 



1-2 



This polynomial is a special case of 



l-2---n 



F = a-Ji + a 2 h 2 -\ \- a n h n . 

We shall prove that, if a\,. . . ,a n are all real, F is numerically less than any 
assigned positive number p for all real values ofh sufficiently small numerically. 
Denote by g the greatest numerical value of a\, . . . , a n . If h is numerically less 
than k, where k < 1, we see that F is numerically less than 



g(k + k 2 + • • • + k n ) < g 



k 



1-jfc 



<p, 



iik< 



p + g 



Hence a real polynomial f(x) is continuous at every real value of x. But the 
function tanx is not continuous at x = 90° (§63). 



63. Root between a and b if f(a) and f(b) have opposite Signs. If 

the coefficients of a polynomial f(x) are real and if a and b are real numbers 
such that f(a) and f(b) have opposite signs, the equation f(x) = has at least 
one real root between a and b; in fact, an odd number of such roots, if an m-fold 
root is counted as m roots. 

The only argument 2 given here (other than that in 
Ex. 5 below) is one based upon geometrical intuition. We 
are stating that, if the points 



J(a)), 



{b,f(b)) 



lie on opposite sides of the x-axis, the graph of y = f(x) 
crosses the x-axis once, or an odd number of times, be- 
tween the vertical lines through these two points. Indeed, 
the part of the graph between these verticals is a contin- 
uous curve having one and only one point on each inter- 
mediate vertical line, since the function has a single value 
for each value of x. 

This would not follow for the graph of y 2 = x, which 
is a parabola with the x-axis as its axis. It may not cross 




O 




Fig. 21 



2 An arithmetical proof based upon a refined theory of irrational numbers is given in 
Weber's Lehrbuch der Algebra, ed. 2, vol. 1, p. 123. 
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the x-axis between the two initial vertical lines, but cross at a point to the left 
of each. 

A like theorem does not hold for f(x) = tanx, when x is measured in radians 
and < a < tt/2 < b < tt, since tanx is not continuous at x = ir/2. When t 
increases from a to tt/2, tanx increases without limit. When x decreases from 
b to 7r/2, tanx decreases without limit. There is no root between a and b of 
tanx = 0. 



EXERCISES 

1. Prove that 8x 3 — 4x 2 — 18x + 9 = has a root between and 1, one between 
1 and 2, and one between —2 and — 1. 

2. Prove that 16x 4 — 24x 2 + 16x — 3 = has a triple root between and 1, and 
a simple root between —2 and — 1. 

3. Prove that if a < b < c ■ ■ ■ < I, and a, /?,..., A are positive, these quantities 
being all real, 

a 3 7 A 



x — a x — b x — c x — I 

has a real root between a and b, one between b and c, . . . one between k and I, and 
if t is negative one greater than I, but if t is positive one less than a. 

4. Verify that the equation in Ex. 3 has no imaginary root by substituting r + si 
and r — si in turn for x, and subtracting the results. 

5. Admitting that an equation /(x) = x n + • • • = with real coefficients has 
n roots, show algebraically that there is a real root between a and b if /(a) and f(b) 
have opposite signs. Note that a pair of conjugate imaginary roots c ± di are the 
roots of 

(x - c) 2 + d 2 = 

and that this quadratic function is positive if x is real. Hence if x±, . . . ,x r are the 
real roots and 

4>{x) = (x — Xl) • • • (x — x r ), 

then <p{a) and <p{b) have opposite signs. Thus a — Xj and b — Xj have opposite signs 
for at least one real root Xj. (Lagrange.) 
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64. Sign of a Polynomial. Given a polynomial 

f(x) = a x n + aix n ~ x -\ Va n (a + 0) 

with real coefficients, we can find a positive number P such that f(x) has the 
same sign as aQX n when x > P. In fact, 

/(x) = x> + ^ <^=^ + ^ + --- + ^. 

X x z x n 

By the result in §62, the numerical value of 4> is less than that of uq when 
1/x is positive and less than a sufficiently small positive number, say 1/P, and 
hence when x > P. Then ao + 4> has the same sign as uq, and hence f(x) the 
same sign as a^x 11 . 

The last result holds also when x is a negative number sufficiently large 
numerically. For, if we set x = —X, the former case shows that f(—X) has the 
same sign as (— l) n aoX n when X is a sufficiently large positive number. 

We shall therefore say briefly that, for x = +oo, f(x) has the same sign 
as uq; while, for x = — oo, f(x) has the same sign as uq if n is even, but the 
sign opposite to ag if n i s odd. 

EXERCISES 

1. Prove that x 3 + ax 2 + bx — 4 = has a positive real root [use x = and 

x = +oo]. 

2. Prove that x 3 + ax 2 + 6x + 4 = has a negative real root [use x = and 
x = — oo]. 

3. Prove that if ao > and n is odd, aox n + • • • + a n = has a real root of sign 
opposite to the sign of a n [use x = — oo, 0, +oo]. 

4. Prove that x 4 + ax 3 + 6x 2 + ex — 4 = has a positive and a negative root. 

5. Show that any equation of even degree n in which the coefficient of x n and 
the constant term are of opposite signs has a positive and a negative root. 

65. Rolle's Theorem. Between two consecutive real roots a and b of 
f(x) = 0, there is an odd number of real roots of f'(x) = 0, a root of multiplic- 
ity m being counted as m roots. 

Let 

/(x) = (x - a) r {x - b) s Q(x), a<b, 
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where Q(x) is a polynomial divisible by neither x — a nor x — b. Then by the 
rule for the derivative of a product (§56, Ex. 10), 

(x — a)(x — b)f'(x) , ,. , . , w ,sQ'(x) 

- r(x-b) + s(x-a) + (x-a)(x-b) 



f(x) v ' y ' v /v ' Q(x) 

The second member has the value r(a—b) < for x = a and the value s(b—a) > 
for x = b, and hence vanishes an odd number of times between a and b (§63). 
But, in the left member, (x — a)(x — b) and f(x) remain of constant sign between 
a and b, since f(x) = has no root between a and b. Hence f'(x) vanishes an 
odd number of times. 

Corollary. Between two consecutive real roots a and (3 of f'(x) = 
there occurs at most one real root of f(x) = 0. 

For, if there were two such real roots a and b of f(x) = 0, the theorem shows 
that f'(x) = would have a real root between a and b and hence between a 
and /?, contrary to hypothesis. 

Applying also §63 we obtain the 

CRITERION. // a and (3 are consecutive real roots of f'(x) = 0, then 
f(x) = has a single real root between a and (3 if /(a) and f(f3) have opposite 
signs, but no root if they have like signs. At most one real root of f(x) = 
is greater than the greatest real root of f'(x) = 0, and at most one real root of 
f(x) = is less than the least real root of f'(x) = 0. 

If f(a) = for our root a of f'(x) = 0, a is a multiple root of f(x) = and 
it would be removed before the criterion is applied. 

Example. For f(x) = 3x 5 - 25x 3 + 60x - 20, 

±f'(x) =x 4 -5x 2 + A = (x 2 - l)(x 2 - 4). 

Hence the roots of f'{x) = are ±1, ±2. Now 

/(_oo) = _oo, /(_2) = -36, /(-l) = -58, /(l) = 18, /(2) = -4, /(+oo) = +oo. 
Hence there is a single real root in each of the intervals 

(-1,1), (1,2), (2, +oo), 
and two imaginary roots. The three real roots are positive. 
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EXERCISES 

1. Prove that a; 5 — 5a: + 2 = has 1 negative, 2 positive and 2 imaginary roots. 

2. Prove that x 6 + x — 1 = has 1 negative, 1 positive and 4 imaginary roots. 

3. Show that a: 5 — 3a; 3 + 2a; 2 — 5 = has two imaginary roots, and a real root 
in each of the intervals (—2,-1.5), (—1.5,-1), (1,2). 

4. Prove that 4x 5 — 3x 4 — 2x 2 + 4x — 10 = has a single real root. 

5. Show that, if f^ k '(x) = has imaginary roots, f(x) = has imaginary roots. 

6. Derive Rolle's theorem from the fact that there is an odd number of bend 
points between a and b, the abscissa of each being a root of f'(x) = of odd 
multiplicity, while the abscissa of an inflexion point with a horizontal tangent is a 
root of f'(x) = of even multiplicity. 



CHAPTER VI 

Isolation of the Real Roots of a Real Equation 

66. Purpose and Methods of Isolating the Real Roots. In the 

next chapter we shall explain processes of computing the real roots of a given 
real equation to any assigned number of decimal places. Each such method 
requires some preliminary information concerning the root to be computed. 
For example, it would be sufficient to know that the root is between 4 and 5, 
provided there be no other root between the same limits. But in the contrary 
case, narrower limits are necessary, such as 4 and 4.3, with the further fact 
that only one root is between these new limits. Then that root is said to be 
isolated. 

If an equation has a single positive root and a single negative root, the real roots 
are isolated, since there is a single root between — oo and 0, and a single one between 
and +oo. However, for the practical purpose of their computation, we shall need 
narrower limits, sufficient to fix the first significant figure of each root, for example 
-40 and -30, or 20 and 30. 

We may isolate the real roots of f(x) = by means of the graph of y = f(x). 
But to obtain a reliable graph, we saw in Chapter V that we must employ the 
bend points, whose abscissas occur among the roots of f'(x) = 0. Since the 
latter equation is of degree n — 1 when f(x) = is of degree n, this method 
is usually impracticable when n exceeds 3. The method based on Rolle's 
theorem (§65) is open to the same objection. 

The most effective method is that due to Sturm (§68). We shall, however, 
begin with Descartes' rule of signs since it is so easily applied. Unfortunately 
it rarely tells us the exact number of real roots. 

67. Descartes' Rule of Signs. Two consecutive terms of a real poly- 
nomial or equation are said to present a variation of sign if their coefficients 
have unlike signs. By the variations of sign of a real polynomial or equation 
we mean all the variations presented by consecutive terms. 

Thus, in x 5 — 2x 3 — 4x 2 + 3 = 0, the first two terms present a variation of sign, 
and likewise the last two terms. The number of variations of sign of the equation is 
two. 
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DESCARTES' RULE The number of positive real roots of an equation with 
real coefficients is either equal to the number of its variations of sign or is less 
than that number by a positive even integer. A root of multiplicity m is here 
counted as m roots. 

For example, x 6 — 3x 2 + x + 1 = has either two or no positive roots, the exact 
number not being found. But 3x 3 — x — 1 = has exactly one positive root, which 
is a simple root. 

Descartes' rule will be derived in §73 as a corollary to Budan's theorem. 
The following elementary proof 1 was communicated to the author by Professor 
D. R. Curtiss. 

Consider any real polynomial 

f(x) = a x n + a x x n ~ X + ■■■ + aix n ~ l (a + 0, a x / 0). 

Let r be a positive real number. By actual multiplication, 

F(x) = {x- r)f(x) = A x n+1 + A x x n + ■■■ + A l+1 x n ~ l , 

where 

Aq = a , Ai=ai-rao, A 2 = a 2 - ra\,. . . ,A t = a t - ra/_i, A l+1 = -ra h 

In f(x) let a^ be the first non-vanishing coefficient of different sign from ao, 
let a^ 2 be the first non- vanishing coefficient following a^ and of the same sign 
as ao, etc., the last such term, a^, being either a/ or of the same sign as a\. 
Evidently v is the number of variations of sign of f(x). 

For example, if f(x) = 2x 6 + 3x 5 — 4x 4 — 6x 3 + 7x, we have v — 2, a^ = 02 = —4, 
ak 2 = a 5 = 7. Note that 04 = since x 2 is absent. 

The numbers AqjA^, . . . ,A^ v , A\ + i are all different from zero and have 
the same signs as 00,0^, . . . ,a^ v , —a/, respectively. This is obviously true for 
Ao = uq and Ai + i = — ro/. Next, A^. is the sum of the non-vanishing number 
aft. and the number — rafc._i, which is either zero or else of the same sign as 
au since au _i is either zero or of opposite sign to au.. Hence the sum Au. is 
not zero and has the same sign as a^.. 

By hypothesis, each of the numbers ao, a^, . . . , a^ v after the first is of op- 
posite sign to its predecessor, while —a/ is of opposite sign to a^ v . Hence each 
term after the first in the sequence Aq, A^ , . . . , A^ v , Ai + \ is of opposite sign to 
its predecessor. Thus these terms present v + 1 variations of sign. We conclude 
that F(x) has at least one more variation of sign than /(#). But we may go 
further and prove the following 



1 The proofs given in college algebras are mere verifications of special cases. 
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LEMMA. The number of variations of sign of F(x) is equal to that of f(x) 
increased by some positive odd integer. 

For, the sequence Aq,Ai, . . . ,^4^ 1 has an odd number of variations of sign 
since its first and last terms are of opposite sign; and similarly for the v se- 
quences 

-Afcii ^-fci+l; ■ • • > ^-ki'i 



A k v i A kv+1 , . . . , A i+1 . 

The total number of variations of sign of the entire sequence Aq, A\, . . . , Ai + i 
is evidently the sum of the numbers of variations of sign for the v + 1 partial 
sequences indicated above, and is thus the sum of v + 1 positive odd integers. 
Since each such odd integer may be expressed as 1 plus or a positive even 
integer, the sum mentioned is equal to v + 1 plus or a positive even integer, 
i.e., to v plus a positive odd integer. 

To prove Descartes' rule of signs, consider first the case in which f(x) = 
has no positive real roots, i.e., no real root between and +oo. Then /(0) and 
/(oo) are of the same sign (§63), and hence the first and last coefficients of 
f(x) are of the same sign. 2 Thus f(x) has either no variations of sign or an 
even number of them, as Descartes' rule requires. 

Next, let f(x) = have the positive real roots r\, . . . ,r& and no others. A 
root of multiplicity m occurs here m times, so that the r's need not be distinct. 
Then 

f(x) = (x - ri) • • • (x - r k )4>{x), 

where </>(x) is a polynomial with real coefficients such that <f>(x) = has no 
positive real roots. We saw in the preceding paragraph that <f>(x) has either 
no variations of sign or an even number of them. By the Lemma, the product 
(x — ffc)0(x) has as the number of its variations of sign the number for <j>(x) 
increased by a positive odd integer. Similarly when we introduce each new 
factor x — ri. Hence the number of variations of sign of the final product f(x) 
is equal to that of <fi(x) increased by k positive odd integers, i.e., by k plus 
or a positive even integer. Since <f>(x) has either no variations of sign or an 
even number of them, the number of variations of sign of f(x) is k plus or a 
positive even integer, a result equivalent to our statement of Descartes' rule. 
If — p is a negative root of f(x) = 0, then p is a positive root of f(—x) = 0. 
Hence we obtain the 

COROLLARY. The number of negative roots of f(x) = is either equal 
to the number of variations of sign of f(—x) or is less than that number by a 
positive even integer. 



2 In case f(x) has a factor x n , we use the polynomial f(x)/x n l instead of fix) in this 
argument. 
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For example, x 4 + 3x 3 + x — 1 = has a single negative root, which is a simple 
root, since x 4 — 3x 3 — x — 1 = has a single positive root. 

As indicated in Exs. 10, 11 below, Descartes' rule may be used to isolate 
the roots. 

EXERCISES 

Prove by Descartes' rule the statements in Exs. 1-8, 12, 15. 

1. An equation all of whose coefficients are of like sign has no positive root. 
Why is this self-evident? 

2. There is no negative root of an equation, like x 5 — 2x 4 — 3x 2 + 7x — 5 = 0, 
in which the coefficients of the odd powers of x are of like sign, and the coefficients 
of the even powers (including the constant term) are of the opposite sign. Verify by 
taking x = —p, where p is positive. 

3. x 3 + a 2 x + b 2 = has two imaginary roots if b / 0. 

4. For n even, x n — 1 = has only two real roots. 

5. For n odd, x n — 1 = has only one real root. 

6. For n even, x n + 1 = has no real root; for n odd, only one. 

7. x 4 + 12x 2 + 5x — 9 = has just two imaginary roots. 

8. x 4 + a 2 x 2 + b 2 x — c 2 = (c / 0) has just two imaginary roots. 

9. Descartes' rule enables us to hnd the exact number of positive roots only 
when all the coefficients are of like sign or when 

f( x ) = X n + piX n ~ l + ■■■+ p n - s X S - Pns+iX 3 ' 1 p n = 0, 

each pi being ^ 0. Without using that rule, show that the latter equation has one 
and only one positive root r. Hints: There is a positive root r by §63 (a = 0, b = oo). 
Denote by P{x) the quotient of the sum of the positive terms by x s , and by —N(x) 
that of the negative terms. Then N(x) is a sum of powers of 1/x with positive 
coefficients. 

If x > r, P(x) > P(r), N(x) < N(r), f(x) > 0; 
If x < r, P(x) < P(r), N(x) > N(r), f(x) < 0. 

10. Prove that we obtain an upper limit to the number of real roots of f{x) = 
between a and b, if we set 

a + by 
x = — — •"• y 



1 + y \ b — x 

multiply by (1 + y) n , and apply Descartes' rule to the resulting equation in y. 
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11. Show by the method of Ex. 10 that there is a single root between 2 and 4 of 
x 3 + x 2 - Ylx + 15 = 0. Here we have 27y 3 + 3y 2 - 23y -7 = 0. 

12. In the astronomical problem of three bodies occurs the equation 

r 5 + (3 - /x)r 4 + (3 - 2/x)r 3 - fir 2 - 2/j,r - /j, = 0, 
where < /x < 1. Why is there a single positive real root? 

13. Prove that x 5 + x 3 — x 2 + 2x — 3 = has four imaginary roots by applying 
Descartes' rule to the equation in y whose roots are the squares of the roots of the 
former. Transpose the odd powers, square each new member, and replace x 2 by y. 

14. As in Ex. 13 prove that x 3 + x 2 + 8a; + 6 = has imaginary roots. 

15. If a real equation /(x) = of degree n has n real roots, the number of 
positive roots is exactly equal to the number V of variations of sign. Hint: consider 
also f{—x). 

16. Show that x 3 — x 2 + 2x + 1 = has no positive root. Hint: multiply by x + 1. 



68. Sturm's Method. Let f(x) = be an equation with real coefficients, 
and f'(x) the first derivative of f(x). The first step of the usual process of 
finding the greatest common divisor of f(x) and /'(x), if it exists, consists in 
dividing / by /' until we obtain a remainder r(x), whose degree is less than that 
of /'. Then, if q\ is the quotient, we have / = q\f' + r. Instead of dividing /' 
by r, as in the greatest common divisor process, and proceeding further in that 
manner, we write fa = — r, divide /' by fa, and denote by fa the remainder 
with its sign changed. Thus 

/ = Qlf' ~ H-, f = 12 fa- fa, fa = ?3/3-/4, ••• • 

The latter equations, in which each remainder is exhibited as the negative 
of a polynomial fa, yield a modified process, just as effective as the usual 
process, of finding the greatest common divisor G of f(x) and f'(x) if it exists. 

Suppose that —fa is the first constant remainder. If fa = 0, then ^3 = G, 
since fa divides fa and hence also /' and / (as shown by using our above 
equations in reverse order); while, conversely, any common divisor of / and /' 
divides fa and hence also ^3. 

But if /4 is a constant / 0, / and /' have no common divisor involving x. 
This case arises if and only if f(x) = has no multiple root (§58), and is the 
only case considered in §§69-71. 

Before stating Sturm's theorem in general, we shall state it for a numerical 
case and illustrate its use. 
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Example. f(x) 



x 3 + Ax 2 - 7. Then /' = 3x 2 + 8x, 



/' = (32 X + T02i)/2 - /3, /3 = 



32 
9 

4221 
1024- 



x + 7, 



For 3 x = 1, the signs of /, /', /2, /3, are — + + + , showing a single variation 
of consecutive signs. For x = 2, the signs are + + + + , showing no variation of 
sign. Sturm's theorem states that there is a single real root between 1 and 2. For 
x = — oo, the signs are — + — + , showing 3 variations of sign. The theorem states 
that there are 3—1 = 2 real roots between — oo and 1. Similarly, 



X 


Signs 


Variations 


-1 


+ + 


1 


-2 


+ + 


2 


-3 


+ + - + 


2 


-4 


- + - + 


3 



Hence there is a single real root between —2 and — 1, and a single one between —4 
and —3. Each real root has now been isolated since we have found two numbers such 
that a single real root lies between these two numbers or is equal to one of them. 

Some of the preceding computation was unnecessary. After isolating a root 
between —2 and — 1, we know that the remaining root is isolated between — oo 
and —2. But before we can compute it by Horner's method, we need closer limits 
for it. For that purpose it is unnecessary to find the signs of all four functions, but 
merely the sign of / (§63). 



69. Sturm's Theorem. Let f(x) = be an equation with real coeffi- 
cients and without multiple roots. Modify the usual process of seeking the 
greatest common divisor of f(x) and its first derivative 4 " f\{x) by exhibiting 
each remainder as the negative of a polynomial fa: 

(l) / = Qlfl - /2> /l = 92/2 - hi h = Q3h - /-4, ■ ■ • > fn-2 = Qn-lfn-1 ~ fn, 

where 5 f n is a constant / 0. If a and b are real numbers, a <b, neither a root 
°f f( x ) — 0) the number of real roots of f(x) = between a and b is equal to 
the excess of the number of variations of sign of 



(2) 



f(x), fl(x), h{x),...,f n -\{x), f n 



3 Bcforc going further, check that the preceding relations hold when x = 1 by inserting 
the computed values of /, /', fa for x = 1. Experience shows that most students make some 
error in finding fa, fa, . . . , so that checking is essential. 

4 The notation fa instead of the usual /', and similarly fa instead of /, is used to regularize 
the notation of all the /'s, and enables us to write any one of the equations (1) in the single 
notation (3). 

5 If the division process did not yield ultimately a constant remainder 7^ 0, / and fa 
would have a common factor involving x, and hence f(x) = a multiple root. 
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for x = a over the number of variations of sign for x = b. Terms which vanish 
are to be dropped out before counting the variations of sign. 

For brevity, let V x denote the number of variations of sign of the numbers (2) 
when x is a particular real number not a root of f(x) = 0. 

First, if x\ and X2 are real numbers such that no one of the continuous 
functions (2) vanishes for a value of x between x\ and X2 or for x = x\ or 
x = X2, the values of any one of these functions for x = x\ and x = X2 are both 
positive or both negative (§63), and therefore V Xl = V X2 . 

Second, let p be a root of fi(x) = 0, where 1 ^ i < n. Then 

(3) fi-i(x) = qifi(x) ~ fi+i(x) 

and the equations (1) following this one show that fi_i(x) and fi(x) have 
no common divisor involving x (since it would divide the constant f n ). By 
hypothesis, fi(x) has the factor x — p. Hence fi_i(x) does not have this factor 
x- p. Thus, by (3), 

fi-l( P ) = -fi+l( P )^0. 

Hence, if p is a sufficiently small positive number, the values of 

fi-l(x), fi(x), fi+i(x) 

forx = p — p show just one variation of sign, since the first and third values 
are of opposite sign, and for x = p + p show just one variation of sign, and 
therefore show no change in the number of variations of sign for the two values 
of x. 

It follows from the first and second cases that V a = Vaiia and (3 are real 
numbers for neither of which any one of the functions (2) vanishes and such 
that no root of f(x) = lies between a and (3. 

Third, let r be a root of f(x) = 0. By Taylor's theorem (8) of §56, 

fir -P) = -Pf'(r) + \p 2 f"(r) , 

/(r +p)= pf\r) + \p 2 f"{r) -\ . 

If p is a sufficiently small positive number, each of these polynomials in p has 
the same sign as its first term. For, after removing the factor p, we obtain 
a quotient of the form ao + s, where s = a\p + a2P 2 + • • • is numerically less 
than ao for all values of p sufficiently small (§62). Hence if f'(r) is positive, 
f(r—p) is negative and f(r+p) is positive, so that the terms f(x), f\(x) = f'(x) 
have the signs — + for x = r — p and the signs + + for x = r + p. If f'(r) is 

negative, these signs are -\ and — — respectively. In each case, /(x), f\(x) 

show one more variation of sign for x = r — p than for x = r + p. Evidently p 
may be chosen so small that no one of the functions fi(x), . . . ,f n vanishes for 
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either x = r — p or x = r + p, and such that f\(x) does not vanish for a value 
of x between r — p and r + p, so that f(x) = has the single real root r between 
these limits (§65). Hence by the first and second cases, fi,...,fn show the 
same number of variations of sign for x = r — p as for x = r + p. Thus, for the 
entire series of functions (2), we have 

(4) V r -p - V r+p = 1. 

The real roots of f(x) = within the main interval from a to b (i.e., the 
aggregate of numbers between a and b) separate it into intervals. By the 
earlier result, V x has the same value for all numbers in the same interval. By 
the present result (4), the value V x in any interval exceeds the value for the next 
interval by unity. Hence V a exceeds Vf, by the number of real roots between a 
and b. 

Corollary. If a <b, then v a ^ V b . 

A violation of this Corollary usually indicates an error in the computation 
of Sturm's functions (2). 



EXERCISES 

Isolate by Sturm's theorem the real roots of 

1. x 3 + 2x + 20 = 0. 2. x 3 + x - 3 = 0. 



70. Simplifications of Sturm's Functions. In order to avoid fractions, 
we may first multiply f(x) by a positive constant before dividing it by by /i(x), 
and similarly multiply /i by a positive constant before dividing it by /2, etc. 
Moreover, we may remove from any fi any factor k\ which is either a positive 
constant or a polynomial in x positive for 6 a ^ x ^ b, and use the remaining 
factor Fi as the next divisor. 

To prove that Sturm's theorem remains true when these modified functions 
/, Fi,...,F m are employed in place of functions (2), consider the equations 
replacing (1): 

f 1 = k 1 F 1 , c 2 f = q\Fi - k 2 F 2 , c 3 Fi = q 2 F 2 - k^F^, 

C4F2 = Q3 F 3 ~ k±F A , ..., c rn F rn _ 2 = q m _iF m _i - k m F m , 



6 Usually we would require that ki be positive for all values of x, since we usually wish 
to employ the limits — 00 and +00. 
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in which C2, C3, . . . are positive constants and F m is a constant / 0. A common 
divisor (involving x) of -Fj_i and Fj would divide i^_2, . . . , F2, i<i, /, /1, whereas 
/(a;) = has no multiple roots. Hence if p is a root of F^{x) = 0, then Fj_i(p) / 
and 

Ci+l-P»-l(p) = -fei+l(p)^+l(p), q+l > 0, fci+i(p) > 0. 

Thus Fj_i and Fi + \ have opposite signs for x = p. We proceed as in §69. 

Example 1. If f(x) = x 3 + 6x — 10, /1 = 3(a; 2 + 2) is always positive. Hence 
we may employ / and F\ = 1. For x = —00, there is one variation of sign; for 
x = +00, no variation. Hence there is a single real root; it lies between 1 and 2. 

Example 2. If f(x) = 2x 4 - 13a; 2 - 10a; - 19, we may take 

f x = 4x 3 - 13a; - 5. 

Then 

2/ = x/i - f 2 , h = 13x 2 + 15x + 38 = 13(x + i) 2 + ^. 

Since /2 is always positive, we need go no further (we may take F2 = 1). For 
x = —00, the signs are + — + ; for x = +00, + + + . Hence there are two real 

roots. The signs for x = are \- . Hence one real root is positive and the other 

negative. 

EXERCISES 

Isolate by Sturm's theorem the real roots of 

1. a; 3 + 3a; 2 - 2a; - 5 = 0. 2. x 4 + 12a; 2 + 5ai - 9 = 0. 

3. a; 3 - 7x - 7 = 0. 4. 3ai 4 - 6x 2 + 8x - 3 = 0. 

5. x 6 + 6x 5 - 30x 2 - 12x - 9 = [stop with f 2 ]. 

6. x 4 - 8a; 3 + 25x 2 - 36a; + 8 = 0. 

7. For / = x 3 + px + q (p 7^ 0), show that f± = 3a; 2 + p, f 2 = —2px — 3q, 

V/i = (-6px + 9q)f 2 - f 3 , h = -V - 27g 2 , 

so that /3 is the discriminant A (§44). Let [p] denote the sign of p. Then the signs 
of/, /1, /2, /3 are 

— + + \p] [A] for x = —00, 
+ + — [p] [A] for x = +00. 

For A negative there is a single real root. For A positive and therefore p negative, 
there are three distinct real roots. For A = 0, f 2 is a divisor of f± and /, so that 
x = —3q/(2p) is a double root. 
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8. Prove that if one of Sturm's functions has p imaginary roots, the initial 
equation has at least p imaginary roots. 

9. State Sturm's theorem so as to include the possibility of a, or b, or both a 
and b being roots of f(x) = 0. 



71. Sturm's Functions for a Quartic Equation. For the reduced 
quartic equation f(z) = 0, 



(5) 



A 9 

/ = z + qz + rz + s, 
f x = 4z :i + 2qz + r, 
fl = —2qz — 3rz — 4s. 



Let q ^ and divide q 2 f\ by fi- The negative of the remainder is 

(6) fs = Lz- 12rs - rq 2 , L = 8qs - 2q 3 - 9r 2 . 

Let L / 0. Then f^ is a constant which is zero if and only if / = has multiple 
roots, i.e., if its discriminant A is zero. We therefore desire f^ expressed as a 
multiple of A. By §50, 

„2 

(7) A = -4P 3 -27Q 2 , P = -As-\, Q = fgs - r 2 - ^q S . 



(8) As = -P- q -, r 2 = -Q-lqP-^q Z . 



We may employ P and Q to eliminate 

3 
We divide L 2 f^ by 

(9) h = Lz + 3rP, L = 9Q + 4qP. 
The negative of the remainder 7 is 

(10) 18r 2 gP 2 - 9r 2 LP + 4sL 2 = q 2 A. 

The left member is easily reduced to q 2 A. Inserting the values (8) and replacing 
L 2 by L(9Q + 4qP), we get 

-18gQP 2 - 12g 2 P 3 - ±§q A P 2 + 2qP 2 L + f g 3 PL - 3q 2 QL. 



7 Found directly by the Remainder Theorem (§14) by inserting the root z = —?>rP/L of 
h = into L 2 / 2 - 
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Replacing L by its value (9), we get q 2 A. Hence we may take 
(11) / 4 = A. 

Hence if qLA / 0, we may take (5), (9), (11) as Sturm's functions. 
Denote the sign of q by [q\. The signs of Sturm's functions are 



- 


-[<?] 


— [L] [A] for x = -ex), 


+ 


-[<?] 


[L] [A] for x = +oo. 



First, let A > 0. If q is negative and L is positive, the signs are -\ 1 \- 

and + + + + +, so that there are four real roots. In each of the remaining 
three cases for q and L, there are two variations of sign in either of the two 
series and hence there is no real root. 

Next, let A < 0. In each of the three cases in which q and L are not both 
positive, there are three variations of sign in the first series and one variation 
in the second, and hence just two real roots. If q and L are both positive, the 
number of variations is 1 in the first series and 3 in the second, so that this 
case is excluded by the Corollary to Sturm's theorem. To give a direct proof, 
note that, by the value of L in (6), L > 0, q > imply 4s > q 2 , i.e., s > 0, and 
hence, by (7), P is negative, so that each term of (10) is ^ 0, whence A > 0. 

Hence, if qLA ^ 0, there are four distinct real roots if and only if A and L 
are positive, and q negative; two distinct real and two imaginary roots if and 
only if A is negative. 

Combining this result with that in Ex. 4 below, we obtain the 

THEOREM. If the discriminant A of z A + qz 2 + rz + s = is negative, there 
are two distinct real roots and two imaginary roots; if A > 0, q < 0, L > 0, 
four distinct real roots; if A > and either q^.0 or L ^0, no real roots. Here 
L = 8qs- 2q i -9r 2 . 

Our discussion furnished also the series of Sturm functions, which may be 
used in isolating the roots. 



EXERCISES 

1. If q A ^ 0, L = 0, then fs = 3rP is not zero (there being no multiple root) 
and its sign is immaterial in determining the number of real roots. Prove that there 
are just two real roots if q < 0, and none if q > 0. By (10), q has the same sign as A. 

2. If rA 7^ 0, q — 0, obtain — fy by substituting z = — 4s/(3r) in f\. Show that 
we may take fs = rA and that there are just two real roots if A < 0, and no real 
roots if A > 0. 
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3. If A / 0, q = r = 0, prove that there are just two real roots if A < 0, and 
no real roots if A > 0. Since A = 256s 3 , check by solving z 4 + s = 0. 

4. If A ^ 0, qL = 0, there are just two real roots if A < 0, and no real roots if 
A > 0. [Combine the results in Exs. 1-3.] 

5. Apply the theorem to Exs. 2, 4, 6 of §70. 

6. Isolate the real roots of Exs. 3, 4, 5 of §48. 

72. Sturm's Theorem for the Case of Multiple Roots. We might 
remove the multiple roots by dividing f(x) by 8 f n (x), the greatest common 
divisor of f(x) and f\ = f'(x); but this would involve considerable work, besides 
wasting the valuable information in hand. As before, we suppose f(a) and f(b) 
different from zero. We have equations (1) in which f n is now not a constant. 

The difference V a — V\, is the number of real roots between a and b, each 
multiple root being counted only once. 

If p is a root of fo(x) = 0, but not a multiple root of f(x) = 0, then fi-i(p) 
is not zero. For, if it were zero, x — p would by (1) be a common factor of / 
and f\. We may now proceed as in the second case in §69. 

The third case requires a modified proof only when r is a multiple root. 
Let r be a root of multiplicity to, to ^ 2. Then f(r), /'(r), . . . ,/( m_1 )(r) are 
zero and, by Taylor's theorem, 

777. 

f(r + p ) = — * f {m) (r) + ■■■ , 

rn—1 

f'(r + p) = —^- /(«)( r ) + .... 

1 • 2 • • • (to — 1) 

These have like signs if p is a positive number so small that the signs of the 
polynomials are those of their first terms. Similarly, f(r — p) and f'(r — p) 
have opposite signs. Hence / and f\ show one more variation of sign for 
x = r — p than for x = r + p. Now (x — r) m_1 is a factor of / and f\ and 
hence, by (1), of ji, ■ ■ ■ , fn- Let their quotients by this factor be </>,(f>i,..., <f> n . 
Then equations (1) hold after the f's are replaced by the </>'s. Taking p so 
small that 4>i(x) = has no root between r — p and r + p, we see by the first 
and second cases in §69 that <f>i, . . . , 4> n show the same number of variations 
of sign for x = r — p as for x = r + p. The same is true for f\, . . . , f n since 
the products of fa, . . . , 4>n by ( x — r) m_1 have for a given x the same signs as 
fa,...,4>n or the same signs as —fa, . . . , — <f> n . But the latter series evidently 
shows the same number of variations of sign as fa,... , <f> n . Hence (4) is proved 
and consequently the present theorem. 



3 The degree of f(x) is not n, nor was it necessarily n in §69. 
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EXERCISES 

1. For / = x 4 — 8x 2 + 16, prove that F\ = x 3 — Ax, F2 = x 2 — 4, iq = XF2. 
Hence n = 2. Verify that V_oo = 2, V^ = 0, and that there are just two real roots, 
each a double root. 



Discuss similarly the following equations. 

2. x 4 -5x 3 +9x 2 -7x+ 3. x 4 +2x 3 -3x 2 -4x+ 

2 = 0. 4 = 0. 



4. x 4 -x 2 -2x+2 = 0. 



73. Budan's Theorem. Let a and b be real numbers, a < b, neither 9 a 
root of f(x) = 0, an equation of degree n with real coefficients. Let V a denote 
the number of variations of sign of 



(12) 



f{x), f\x), f"{x), 



/(»)(x) 



for x = a, after vanishing terms have been deleted. Then V a — V^ is either the 
number of real roots of f(x) = between a and b or exceeds the number of those 
roots by a positive even integer. A root of multiplicity m is here counted as 
m roots. 



For example, if f(x) 
values for x = 3, 4, —2, 



= x 3 - 7x - 7, then /' = 3x 2 - 7, /" = Gx, f" = 6. Their 
1 are tabulated below. 



X 


/ 


/' 


/" 


/"' 


Variations 


3 


-1 


20 


18 


6 


1 


4 


29 


41 


24 


6 





-2 


-1 


5 


-12 


6 


3 


-1 


-1 


-4 


-6 


6 


1 



Hence the theorem shows that there is a single real root between 3 and 4, and two or 
no real roots between —2 and — 1. The theorem does not tell us the exact number of 
roots between the latter limits. To decide this ambiguity, note that /(— 3/2) = +1/8, 
so that there is a single real root between —2 and —1.5, and a single one between 
— 1.5 and — 1. 

The proof is quite simple if no term of the series (12) vanishes for x = a 
or for x = b and if no two consecutive terms vanish for the same value of x 
between a and b. Indeed, if no one of the terms vanishes for x\ ^ x ^ X2, then 



9 In case a or b is a root of fix) = 0, the theorem holds if we count the number of roots 
> a and ^ b. This inclusive theorem has been proved, by means of Rolle's theorem, by 
A. Hurwitz, Mathematische Annalen, Vol. 71, 1912, p. 584, who extended Budan's theorem 
from the case of a polynomial to a function fix) which is real and regular for a ^ x < b. 
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V xx = V X2 , since any term has the same sign for x = x\ as for x = xi- Next, 
let r be a root of f' l '(x) = 0, a < r < b. By hypothesis, the first derivative 
f( t+1 \x) of /w(x) is not zero for x = r. As in the third step (now actually 
the case i = 0) in §69, f^'(x) and /( l+1 )(x) show one more variation of sign for 
x = r — p than for x = r + p, where p is a sufficiently small positive number. If 
i > 0, /W is preceded by a term f^ -1 ' in (12). By hypothesis, /^ _1 -*(a;) ^ 
for x = r and hence has the same sign for x = r — p and x = r + p when p 
is sufficiently small. For these values of x, f^'(x) has opposite signs. Hence 
/(* -1 ) and /W show one more or one less variation of sign for x = r — p than 
for x = r + p, so that i"^ -1 -*, f^\ f( t+1 > show two more variations or the same 
number of variations of sign. 

Next, let no term of the series (12) vanish for x = a or for x = b, but let 
several successive terms 

(13) /(*)( X ), f^ +1 \x),...,f^- 1 \x) 

all vanish for a value r of x between a and 6, while f( i+ J'(r) is not zero, but is 
say positive. 10 Let I\ be the interval between r — p and r, and 1^ the interval 
between r and r + p. Let the positive number p be so small that no one of the 
functions (13) or f^ + ^(x) is zero in these intervals, so that the last function 
remains positive. Hence /^ + - ?_1 )(x) increases with x (since its derivative is 
positive) and is therefore negative in I\ and positive in 1^- Thus f( l+ J~ 2 >(x) 
decreases in I\ and increases in 1^ and hence is positive in each interval. In 
this manner we may verify the signs in the following table: 



h 

h 



j(i) j-(i+l) j(i+2) __ f(i+j-S) f(i+j-2) f(i+j-l) f(i+j) 

i-y i-y- 1 i-y- 2 ... + + 

+ ++...+ + + + 



Hence these functions show j variations of sign in I\ and none in I^- 

If i > 0, the first term of (13) is preceded by a function f^ l ~ 1 \x) which is 
not zero for x = r, and hence not zero in I\ or 1^ if p is sufficiently small. If j 
is even, the signs of f^~ 1 ' and f' 1 ' are + + or — h in both I\ and I2, showing 
no loss in the number of variations of sign. If j is odd, their signs are 



+ - 

or 

+ + - + 



h 

h 
so that there is a loss or gain of a single variation of sign. Hence 



If negative, all signs in the table below are to be changed; but the conclusion holds. 
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show a loss of j variations of sign if j is even, and a loss of j ± 1 if j is odd, 
and hence always a loss of an even number ^ of variations of sign. 

If i = 0, /W = / has r as a j-fold root and the functions in the table show 
j more variations of sign for x = r — p than for x = r + p. 

Thus, when no one of the functions (12) vanishes for x = a or for x = b, the 
theorem follows as at the end of §69 (with unity replaced by the multiplicity 
of a root). 

Finally, let one of the functions (12), other than f(x) itself, vanish for x = a 
or for x = b. If 8 is a sufficiently small positive number, all of the N roots of 
f(x) = between a and b lie between a + 8 and b — 8, and for the latter values 
no one of the functions (12) is zero. By the above proof, 

V a+5 - V b _ 5 = N + 2t, 
V a - V a+5 = 2j, V b _ s -V b = 2s, 

where t, j, s are integers ^ 0. Hence V a - V b = N + 2(t + j + s). 

Descartes' rule of signs (§67) is a corollary to Budan's theorem. Consider 
any equation with real coefficients 

f(x) = a,QX n + a\x n ~ x + h a n -\x + a n = 0, 

having a n / 0. For x = the functions (12) have the same signs as 

a n , a n _i, ..., ai, aQ. 

Hence Vo is equal to the number V of variations of sign of f(x). 

For x = +oo, the functions all have the same sign, which is that of aQ. Thus 
Vo — Foo = V is either the number of positive roots or exceeds that number by 
a positive even integer. Finally, Descartes' rule holds if a n = 0, as shown by 
removing the factors x. 

EXERCISES 

Isolate by Budan's theorem the real roots of 

1. x 3 - x 2 - 2x + 1 = 0. 2. x 3 + 3x 2 - 2x - 5 = 0. 

3. Prove that if f(a) / 0, V a equals the number of real roots > a or exceeds 
that number by an even integer. 

4. Prove that there is no root greater than a number making each of the func- 
tions (12) positive, if the leading coefficient of f(x) is positive. (Newton.) 

5. Hence verify that x 4 — 4x 3 — 3x + 23 = has no root > 4. 

6. Show that x 4 — 4x 3 + x 1 + 6a; + 2 = has no root > 3. 



CHAPTER VII 

Solution of Numerical Equations 

74. Horner's Method. 1 After we have isolated a real root of a real 
equation by one of the methods in Chapter VI, we can compute the root to 
any desired number of decimal places either by Horner's method, which is 
available only for polynomial equations, or by Newton's method (§75), which 
is applicable also to logarithmic, trigonometric, and other equations. 

To find the root between 2 and 3 of 

(1) x 3 - 2a;- 5 = 0, 

set x = 2 + p. Direct substitution gives the transformed equation for p: 

(2) p 3 + 6p 2 + 10p - 1 = 0. 

The method just used is laborious especially for equations of high degree. We 
next explain a simpler method. Since p = x — 2, 

x 3 - 2x - 5 = (x - 2) 3 + 6(x - 2) 2 + 10(x - 2) - 1, 

identically in x. Hence —1 is the remainder obtained when the given polynomial 
x 3 — 2x — 5 is divided by x — 2. By inspection, the quotient Q is equal to 

(x-2) 2 + 6(x-2) + 10. 

Hence 10 is the remainder obtained when Q is divided by x — 2. The new 
quotient is equal to (x — 2) + 6, and another division gives the remainder 6. 
Hence to find the coefficients 6, 10, —1 of the terms following p 3 in the trans- 
formed equation (2), we have only to divide the given polynomial x 3 — 2x — 5 
by x — 2, the quotient Q by x — 2, etc., and take the remainders in reverse 
order. However, when this work is performed by synthetic division (§15) as 



1 W. G. Horner, London Philosophical Transactions, 1819. Earlier (1804) by P. Ruffini. 
See Bulletin American Math. Society, May, 1911. 
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tabulated below, no reversal of order is necessary, since the coefficients then 
appear on the page in their desired order. 



1 





-2 


-5 




2 


4 


4 


1 


2 


2 


-1 




2 


8 




1 


4 
2 


10 




1 


6 





Thus 1, 6, 10, —1 are the coefficients of the desired equation (2). 

To obtain an approximation to the decimal p, we ignore for the moment 
the terms involving p 3 and p 2 ; then by 10p—l = 0, p = 0.1. But this value is too 
large since the terms ignored are all positive. For p = 0.09, the polynomial in (2) 
is found to be negative, while for p = 0.1 it was just seen to be positive. Hence 
p = 0.09 + h, where h is of the denomination thousandths. The coefficients 1, 
6.27, ... of the transformed equation for h appear in heavy type just under the 
first zigzag line in the following scheme: 



1 


6 


10 


-1 


| 0.09 




0.09 


0.5481 


0.949329 




1 


6.09 
0.09 


10.5481 
0.5562 


-0.050671 




1 


6.18 


11.1043 


0.05 




0.09 


0.025096 


0.044517584 


11.1 


1 


6.27 

0.004 


= 0.004 






1 


6.274 
0.004 


11.129396 
0.025112 


-0.006153416 




1 


6.278 


11.154508 






0.004 








1 


6.282 





Hence x = 2.094 + 1, where t is a root of 



i 3 + 6.282t 2 + 11.154508* - 0.006153416 = 0. 



By the last two terms, t is between 0.0005 and 0.0006. Then the value of 
C = t 3 + 6.282t 2 is found to lie between 0.00000157 and 0.00000227. Hence we 
may ignore C provided the constant term be reduced by an amount between 
these limits. Whichever of the two limits we use, we obtain the same dividend 
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below correct to 6 decimal places. 



11.154508 



0.006151 

5577 



0.000551= t 



574 
558 

16 
11 



Since the quotient is 0.0005+ , only two decimal places of the divisor are used, 
except to see by inspection how much is to be carried when making the first 
multiplication. Hence we mark a cross above the figure 5 in the hundredths 
place of the divisor and use only 11.15. Before making the multiplication by the 
second significant figure 5 of the quotient t, we mark a cross over the figure 1 in 
the tenths place of the divisor and hence use only 11.1. Thus x = 2.0945514+, 
with doubt only as to whether the last figure should be 4 or 5. 

If we require a greater number of decimal places, it is not necessary to go 
back and construct a new transformed equation from the equation in t. We 
have only to revise our preceding dividend on the basis of our present better 
value of t. We now know that t is between 0.000551 and 0.000552. To compute 
the new value of the correction C, in which we may evidently ignore t 3 , we use 
logarithms. 



>g 


5.51 = .74115 


log 


5.52 = .74194 


'g 


5.51 2 = 1.48230 


.-. log 


5.52 2 = 1.48388 


>g 


6.282 = .79810 


log 
log 


6.282 = .79810 


'g 


190.72 = 2.28040 


191.42 = 2.28198 



Hence C is between 0.000001907 and 0.000001915. Whichever of the two limits 
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we use, we obtain the same new dividend below correct to 8 decimal places. 
11.154508 I 0.00615150 I 0.00055148 



557725 

57425 
55773 

1652 
1115 



537 
446 

91 

89 



2 
Hence, finally, x = 2.094551482, with doubt only as to the last figure. 

EXERCISES 

(The number of transformations made by synthetic division should be about half 
the number of significant figures desired for a root.) 

By one of the methods in Chapter VI, isolate each real root of the following 
equations, and compute each real root to 5 decimal places. 

1. x 3 + 2x + 20 = 0. 2. x 3 + 3x 2 - 2x - 5 = 0. 

3. x 3 + x 2 -2x- 1 = 0. 4. x 4 + 4a; 3 -17.5a; 2 -18x + 58.5 = 0. 

5. x 4 - 11, 727a; + 40, 385 = 0. 6. x 3 = 10. 

Find to 7 decimal places all the real roots of 

7. x 3 + Ax 2 - 7 = 0. 8. x 3 - 7x - 7 = 0. 

Find to 8 decimal places 

9. The root between 2 and 3 of a; 3 — x — 9 = (make only 3 transformations). 

10. The real cube root of 7.976. 

11. The abscissa of the real point of intersection of the conies y = x 2 , xy + x + 
3y - 6 = 0. 
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1 2 . Find to 3 decimal places the abscissas of the points of intersection of x 2 + y 2 = 
9, y = x 2 — x. 

13. A sphere two feet in diameter is formed of a kind of wood a cubic foot of 
which weighs two-thirds as much as a cubic foot of water (i.e., the specific gravity of 
the wood is 2/3). Find to four significant figures the depth h to which the floating 
sphere will sink in water. 

Hints: The volume of a sphere of radius r is n^r 3 . Hence our sphere whose radius 
is 1 foot weighs as much as gvr • 3 cubic feet of water. The volume of the submerged 
portion of the sphere is 7rh 2 (r — ^h) cubic feet. Since this is also the volume of the 
displaced water, its value for r = 1 must equal in ■ I. Hence h 3 — 3h 2 + 1 = 0. 



14. If the specific gravity of cork is 1/4, find to four significant figures how far a 
cork sphere two feet in diameter will sink in water. 

15. Compute cos 20° to four decimal places by use of 



cos 3,4 = 4 cos 3 A- 3 cos ,4, cos 60° = \. 

16. Three intersecting edges of a rectangular parallelopiped are of lengths 6, 8, 
and 10 feet. If the volume is increased by 300 cubic feet by equal elongations of the 
edges, find the elongation to three decimal places. 

17. Given that the volume of a right circular cylinder is an and the total area of 
its surface is 2/37T, prove that the radius r of its base is a root of r 3 — fir + a = 0. 
If a = 56, (3 = 28, find to four decimal places the two positive roots r. The 
corresponding altitude is a/r 2 . 

18. What rate of interest is implied in an offer to sell a house for $2700 cash, or 
in annual installments each of $1000 payable 1,2, and 3 years from date? 

Hint: The amount of $2700 with interest for 3 years should be equal to the sum of 
the first payment with interest for 2 years, the amount of the second payment with 
interest for 1 year, and the third payment. Hence if r is the rate of interest and we 
write x for 1 + r, we have 

2700x 3 = 1000a; 2 + 1000a; + 1000. 

19. Find the rate of interest implied in an offer to sell a house for $3500 cash, or 
in annual installments each of $1000 payable 1, 2, 3, and 4 years from date. 

20. Find the rate of interest implied in an offer to sell a house for $3500 cash, or 
$4000 payable in annual installments each of $1000, the first payable now. 
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75. Newton's Method. Prior to 1676, Newton 2 had already found the 
root between 2 and 3 of equation (1). He replaced x by 2 + p and obtained (2). 
Since p is a decimal, he neglected the terms in p 3 and p 2 , and hence obtained 
p = 0.1, approximately. Replacing p by 0.1 + q in (2), he obtained 

q :i + 6.3<? 2 + 11.23q + 0.061 = 0. 

Dividing —0.061 by 11.23, he obtained —0.0054 as the approximate value of q. 
Neglecting g 3 and replacing q by —0.0054 + r, he obtained 

6.3r 2 + 11.16196r + 0.000541708 = 0. 

Dropping 6.3r 2 , he found r and hence 

x = 2 + 0.1 - 0.0054 - 0.00004853 = 2.09455147, 

of which all figures but the last are correct (§74). But the method will not 
often lead so quickly to so accurate a value of the root. 

Newton used the close approximation 0.1 to p, in spite of the fact that this 
value exceeds the root p and hence led to a negative correction at the next 
step. This is in contrast with Horner's method in which each correction is 
positive, so that each approximation must be chosen less than the root, as 0.09 
for p. 

Newton's method may be presented in the following general form, which 
is applicable to any equation f(x) = 0, whether f(x) is a polynomial or not. 
Given an approximate value a of a real root, we can usually find a closer 
approximation a + h to the root by neglecting the powers h 2 , /i 3 ,... of the 
small number h in Taylor's formula (§56) 

f(a + h) = f(a) + f'(a)h + f"(a)^- + --- 
and hence by taking 

-f(a) 



f(a) + f'(a)h = 0, h 



f'(a) 



We then repeat the process with a\ = a + h in place of the former a. 
Thus in Newton's example, f(x) = x 3 — 2x — 5, we have, for a = 2, 

h = TW = w> «i = « + * = 2.i, 

-/(2.1) -0.061 

hi = „,; / = = -0.0054 .... 

/'(2.1) 11.23 



2 Isaac Newton, Opuscula, I, 1794, p. 10, p. 37. 
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76. Graphical Discussion of Newton's Method. Using rectangular 
coordinates, consider the graph of y = f(x) and the point P on it with the 
abscissa OQ = a (Fig. 22). Let the tangent at P meet the x-axis at T and let 





Fig. 22 Fig. 23 

the graph meet the x-axis at 5. Take h = QT, the subtangent. Then 



QP = f(a), 



f'(a) = tan XTP 



"/(a) 



h 



/'(a) 



In the graph in Fig. 22, OT = a + h is a better approximation to the root OS 
than OQ = a. The next step (indicated by dotted lines) gives a still better 
approximation OT\. 

If, however, we had begun with the abscissa a of a point P\ in Fig. 22 
near a bend point, the subtangent would be very large and the method would 
probably fail to give a better approximation. Failure is certain if we use a 
point P2 such that a single bend point lies between it and S. 

We are concerned with the approximation to a root previously isolated as 
the only real root between two given numbers a and (3. These should be chosen 
so nearly equal that f'(x) = has no real root between a and (3, and hence 
f(x) = y has no bend point between a and (3. Further, if f"(x) = has a root 
between our limits, our graph will have an inflexion point with an abscissa 
between a and (3, and the method will likely fail (Fig. 23). 

Let, therefore, neither f'(x) nor f"(x) vanish between a and (3. Since /" 
preserves its sign in the interval from a to f3, while / changes in sign, /" and / 
will have the same sign for one end point. According as the abscissa of this 
point is a or /3, we take a = a or a = (3 for the first step of Newton's process. 
In fact, the tangent at one of the end points meets the x-axis at a point T with 
an abscissa within the interval from a to (3. If f(x) is positive in the interval, 
so that the tangent makes an acute angle with the x-axis, we have Fig. 24 or 
Fig. 25; if /' is negative, Fig. 26 or Fig. 22. 
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a T 





Fig. 24 



Fig. 25 



Fig. 26 



In Newton's example, the graph between the points with the abscissas a = 2 and 
[3 = 3 is of the type in Fig. 24, but more nearly like a vertical straight line. In view 
of this feature of the graph, we may safely take a = a, as did Newton, although our 
general procedure would be to take a = (3. The next step, however, accords with our 
present process; we have a = 2, (3 = 2.1 in Fig. 24 and hence we now take a = (3, 
getting 

°- 061 0.0054 



11.23 



as the subtangent, and hence 2.1 — 0.0054 as the approximate root. 

If we have secured (as in Fig. 24 or Fig. 26) a better upper limit to the 
root than /3, we may take the abscissa c of the intersection of the chord AB 
with the x-axis as a better lower limit than a. By similar triangles, 

-f{a) :c-a= f{f3) : (3 - c, 



whence 
(3) 






This method of finding the value of c intermediate to a and (3 is called the 
method of interpolation (regula falsi). 
In Newton's example, a = 2, (3 = 2.1, 



/(«) 



/(/?) = 0.061, 



c = 2.0942. 



The advantage of having c at each step is that we know a close limit of the 
error made in the approximation to the root. 

We may combine the various possible cases discussed into one: 
If f( x ) — has a single real root between a and j3, and f'{x) = 0, f"(x) = 
have no real root between a and (3, and if we designate by (3 that one of the 
numbers a and [3 for which /(/?) and /"(/?) have the same sign, then the root 
lies in the narrower interval from c to (3 — f{f3)/f'{/3), where c is given by (3). 
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It is possible to prove 3 this theorem algebraically and to show that by 
repeated applications of it we can obtain two limits a! , (3 1 between which the 
root lies, such that a! ' —(3' is numerically less than any assigned positive number. 
Hence the root can be found in this manner to any desired accuracy. 

Example. f(x) = x 3 - 2x 2 - 2, a = 2\, (3 = l\. Then 

/(«) = -!, /(/?) = §■ 

Neither of the roots 0, 4/3 of f'(x) = lies between a and f3, so that f(x) = has 
a single real root between these limits (§65). Nor is the root =■ of f"(x) = within 
these limits. The conditions of the theorem are therefore satisfied. For a < x < (3, 
the graph is of the type in Fig. 24. We find that approximately 

c= § =2.3487, A = /3-^| = 2.3714, 

For x = 2.3593, f{x) = —0.00003. We therefore have the root to four decimal places. 
For a = 2.3593, 

f'(a) = 7.2620, a - ^\ = 2.3593041, 

which is the value of the root correct to 7 decimal places. We at once verify that the 
result is greater than the root in view of our work and Fig. 24, while if we change 
the final digit from 1 to 0, f{x) is negative. 



EXERCISES 

1. For f(x) = x 4 + x 3 — 3x 2 — x — 4, show by Descartes' rule of signs that 
f'(x) = and f"(x) = each have a single positive root and that neither has a root 
between 1 and 2. Which of the values 1 and 2 should be taken as (31 

2. When seeking a root between 2 and 3 of x 3 — x — 9 = 0, which value should 
be taken as (31 



3 Weber's Algebra, 2d ed., I, pp. 380-382; Kleines Lehrbuch der Algebra, 1912, p. 163. 
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77. Systematic Computation of Roots by Newton's Method. By 

way of illustration we shall compute to 7 decimal places a positive root of 

f[x) = X A + x 3 - 3x 2 - x - 4 = 0. 

Since /(l) = —6, /(2) = 6, there is a real root between 1 and 2. Since 

f(x) = 4x 3 + 3x 2 -6x-l, /'(1) = 0, /'(2) = 31, 

the graph of y = f(x) is approximately horizontal near (1,-6) and approxi- 
mately vertical near (2,6). Hence the root is much nearer to 2 than to 1. Thus 
in applying Newton's method we employ a = 2 as the first approximation to 
the root. The correction h is then 

-/(2) -6 

- -0.2... . 



/'(2) 31 

The work of performing the substitutions x = 2 + d, d = —0.2 + e, . . . , to find 
the transformed equations satisfied by d, e, . . . , is done by synthetic division, 
exactly as in Horner's method, except that some of the multipliers are now 
negative: see Table 1, page 107. 

The root is 2 - 0.2 - 0.04 + 0.000302 = 1.760302, in which the last figure 
is in slight doubt. Indeed, it can be proved that if the final fraction g, when 
expressed as a decimal, has k zeros between the decimal point and the first 
significant figure, the division may be safely carried to 2k decimal places. In 
our example k = 3, so that we retained 6 decimal places in g. 

To proceed independently of this rule, we note that g is obviously between 
0.00030 and 0.00031. Then the value of g 4 + 8.04y 3 + 20.8656y 2 is found to lie 
between 0.000001878 and 0.000002006. Whichever of these limits we use as a 
correction by which to reduce the constant term, we obtain the same dividend 
below correct to 6 decimal places. 



xx xx 



19.539904 | 0.005896 | 0.0003017 
005862 



34 
20 

14 
14 



Hence the root is 1.7603017 to 7 decimal places. 
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8.2 
-0.04 



8.04 



-3 

6 



-0.3264 



-0.860544 



-4 

10 



1 


3 


3 


5 


6 




2 


10 


26 




1 


5 
2 


13 
14 


31 




1 


7 


27 






2 









-0.2 



-0.2 


-1.76 


-5.048 


-5.1904 


8.8 
-0.2 


25.24 

-1.72 


25.952 
-4.704 


0.8096 


8.6 
-0.2 


23.52 

-1.68 


21.248 




8.4 
-0.2 


21.84 





-0.81549824 



-0.8096 

21.248 

= -0.04 



8.16 


21.5136 


20.387456 


-0.00589824 


-0.04 


-0.3248 


-0.847552 




8.12 


21.1888 


19.539904 




-0.04 


-0.3232 




8.08 


20.8656 




-0.04 




0. 


005898 



19.54 



.000302 



Table 1. 
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EXERCISES 

1. Find to 8 decimal places the root between 2 and 3 of x 3 — x — 9 = 0. 

2. Find to 7 decimal places the root between 2 and 3 of x 3 — 2x 2 — 2 = 0. 

3. Find the real cube root of 7.976 to 6 decimal places. 

4. Explain by Taylor's expansion of /(2 + d) why the values of 

/(2), /(2), ±/"( 2 ), ^r(2), ^l^r't 2 ) 

are in reverse order the coefficients of the transformed equation 

d 4 + 9d 3 + 27d 2 + 31d + 6 = 0, 
obtained in the Example in the text, and printed in heavy type. 

5. The method commonly used to find the positive square root of n by a com- 
puting machine consists in dividing n by an assumed approximate value a of the 
square root and taking half the sum of a and the quotient as a better approxima- 
tion. Show that the latter agrees with the value of a + h given by applying Newton's 
method to f(x) = x 2 — n. 

78. Newton's Method for Functions not Polynomials. 

EXAMPLE 1. Find the angle x at the center of a circle subtended by a chord 
which cuts off a segment whose area is one-eighth of that of the circle. 

Solution. If x is measured in radians and if r is the radius, the area of the 
segment is equal to the left member of 

\r (x — sinx) = |7rr , 

whence 

x — sinx = |7r. 

By means of a graph of y = sinx and the straight line represented by y = x — ^7r, 
we see that the abscissa of their point of intersection is approximately 1.78 radians 
or 102°. Thus a = 102° is a first approximation to the root of 

f(x) = x — sinx — j7T = 0. 

By Newton's method a better approximation is a + ft,, where 4 

-/(a) -a + sin a + \tt 



h 



/'(a) 1 — cos a 



4 The derivative of sinx is cosx. We need the limit of 

sin(x + 2k) — sinx _ 2 cos \{2x + 2k) sin \{2k) _ cos(x + fc)sinfc 

as 2k approaches zero. Since the ratio of sin A; to k approaches 1, the limit is cosx. 
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sin 102°= 0.9781 cos 102° = -0.2079 

|(3.1416)= 0.7854 1 - cos 102° = 1.2079 

1.7635 

-0.0167 
102° = 1.7802 radians h = 1 2Q79 = -0.0138 

~ °- 0167 oi = a + h = 1.7664 

-f(ai) -1.7664 + 0.9809 + 0.7854 



-0.0001. 



f'(ai) 1.1944 

Hence x = a\ + h\ = 1.7663 radians, or 101°12'. 

Example 2. 5 Solve x — logx = 7, the logarithm being to base 10. 

Solution. Evidently x exceeds 7 by a positive decimal which is the value 
of logx. Hence in a table of common logarithms, we seek a number x between 7 
and 8 whose logarithm coincides approximately with the decimal part of x. We read 
off the values in the second column. 



7.897 
7.898 



logx 



0.89746 
0.89752 



log a: 



6.99954 
7.00048 



By the final column the ratio of interpolation is 46/94. Hence x = 7.8975 to four 
decimal places. 

Example 3. Solve 2x — logx = 7, the logarithm being to base 10. 

Solution. Evidently x is a little less than 4. A table of common logarithms 
shows at once that a fair approximation to x is a = 3.8. Write 

f(x) = 2x - log x - 7, log x = M log e x, M = 0.4343. 

By calculus, the derivative of log e x is 1/x. Hence 

fix) = 2 , f(a) = 2 - 0.1143 = 1.8857, 

x 

/(o) = 0.6 - log 3.8 = 0.6 - 0.57978 = 0.02022, 

-ft = 4M = 0.0107, oi = a + h = 3.7893, 

/(a) 

/(ai) = 0.000041, 7(3.7892) = -0.000148. 

— x 0.0001 = 0.000078, x = 3.789278. 

189 

All figures of x are correct as shown by Vega's table of logarithms to 10 places. 



5 This Ex. 2, which should be contrasted with Ex. 3, is solved by interpolation since that 
method is simpler than Newton's method in this special case. 
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EXERCISES 

Find the angle x at the center of a circle subtended by a chord which cuts off a 
segment whose ratio to the circle is 

1 1 2 5 

i. 4 . z. g . 

When the logarithms are to base 10, 

3. Solve 2x — logx = 9. 4. Solve 3x — logx = 9. 

5. Find the angle just > 15° for which ^ sinx + sin2x = 0.64. 

6. Find the angle just > 72° for which x — ^ sinx = j7r. 

7. Find all solutions of Ex. 5 by replacing sin2x by 2 sin x cos x, squaring, and 
solving the quartic equation for cosx. 

8. Solve similarly sinx + sin2x =1.2. 

9. Find x to 6 decimal places in sinx = x — 2. 
10. Find x to 5 decimal places in x = 31og e x. 

79. Imaginary Roots. To find the imaginary roots x + yi of an equation 
f(z) = with real coefficients, expand f(x + yi) by Taylor's theorem; we get 

/(*) + f'(x)yi - f"(x)^ - /'"(*) j^ + • • • = 0. 



Since x and y are to be real, and y / 0, 

(4) 



f'(x)-f"(x) T ^ + f( 5 \x)^-... = 0. 

In the Example and Exercises below, f(z) is of degree 4 or less. Then the 
second equation (4) is linear in y 2 . Substituting the resulting value of y 1 in 
the first equation (4), we obtain an equation E(x) = 0, whose real roots may 
be found by one of the preceding methods. If the degree of f(z) exceeds 4, we 
may find E(x) = by eliminating y 2 between the two equations (4) by one of 
the methods to be explained in Chapter X. 

Example. For f(z) = z A — z + 1, equations (4) are 

x 4 - x + 1 - 6x 2 y 2 + / = 0, 4x 3 - 1 - 4xy 2 = 0. 



579.1 IMAGINARY ROOTS 111 



Thus 



y 2 = x 2 - — , -Ax 6 + x 2 + — = 0. 

Ax 16 



The cubic equation in x has the single real root 

x 2 = 0.528727, x = ±0.72714. 

Then y 2 = 0.184912 or 0.87254, and 

z = x + yi = 0.72714 ± 0.4300H, -0.72714 ± 0.934091 



EXERCISES 

Find the imaginary roots of 

1. z 3 - 2z - 5 = 0. 2. 28z 3 + 9z 2 - 1 = 0. 

3. z A - 3z 2 -6z = 2. 4. z A - 4z 3 + llz 2 - Uz + 10 = 0. 

5. z A - Az 3 + 9z 2 - Wz + 20 = 0. Hint: 

E(x) = x(x - 2) (16a; 4 - 6Ax 3 + 136a; 2 - 144x + 65) = 0, 

and the last factor becomes (w 2 + l)(w 2 + 9) for 2x = w + 2. 

Note. If we know a real root r of a cubic equation f{z) = 0, we may remove 
the factor z — r and solve the resulting quadratic equation. When, as usual, r 
involves several decimal places, this method is laborious and unsatisfactory. But 
we may utilize a device, explained in the author's Elementary Theory of Equations, 
pp. 119-121, §§6, 7. As there explained, a similar device may be used when we know 
two real roots of a quartic equation. 
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MISCELLANEOUS EXERCISES 

(Give answers to 6 decimal places, unless the contrary is stated.) 

1. What arc of a circle is double its chord? 

2. What arc of a circle is double the distance from the center of the circle to 
the chord of the arc? 

3. If .A and B are the points of contact of two tangents to a circle of radius 
unity from a point P without it, and if arc AB is equal to PA, find the length of 
the arc. 

4. Find the angle at the center of a circle of a sector which is bisected by its 
chord. 

5. Find the radius of the smallest hollow iron sphere, with air exhausted, which 
will float in water if its shell is 1 inch thick and the specific gravity of iron is 7.5. 

6. From one end of a diameter of a circle draw a chord which bisects the semi- 
circle. 

7. The equation xtanx = c occurs in the theory of vibrating strings. Its ap- 
proximate solutions may be found from the graphs y = cot x, y = x/c. Find x when 
c= 1. 

8. The equation tanx = x occurs in the study of the vibrations of air in a 
spherical cavity. From an approximate solution x\ = 1.5n, we obtain successively 
better approximations X2 = tan -1 x\ = 1.43347T, X3 = tan -1 X2, • • • • Find the first 
three solutions to 4 decimal places. 

9. Find to 3 decimal places the first five solutions of 

2x 
tanx 



2-x 2 ' 
which occurs in the theory of vibrations in a conical pipe. 

10. 4rx 3 — (3x — l) 2 = arises in the study of the isothermals of a gas. Find its 
roots when (i) r = 0.002 and (ii) r = 0.99. 

11. Solve x x = 100. 

12. Solve x = 10 log x. 

13. Solve x + log x = x log x. 

14. Solve Kepler's equation M = x - esinx when M = 332°28'54.8", e = 
14°3'20 // . 
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15. In what time would a sum of money at 6% interest compounded annually 
amount to as much as the same sum at simple interest at 8%? 

16. In a semicircle of diameter x is inscribed a quadrilateral with sides a, b, c, x; 
then x 3 — (a 2 + b 2 + c 2 )x — 2abc = (I. Newton). Given a = 2, b = 3, c = 4, find x. 

17. What rate of interest is implied in an offer to sell a house for $9000 cash, or 
$1000 down and $3000 at the end of each year for three years? 



CHAPTER VIII 

Determinants; Systems of Linear Equations 

80. Solution of Two Linear Equations by Determinants of Order 2. 

Assume that there is a pair of numbers x and y for which 



(1) 



a\x + b\y = ki, 

a 2 x + b 2 y = k 2 . 



Multiply the members of the first equation by b 2 and those of the second 
equation by — b\, and add the resulting equations. We get 

(ai&2 - a 2 b\)x = k\b 2 - k 2 b\. 
Employing the respective multipliers — a 2 and a\, we get 

(ai&2 - a 2 b\)y = a\k 2 - a 2 k\. 
The common multiplier of x and y is 
(2) a\b 2 -a 2 bi, 

and is denoted by the symbol 

(2') 



oi &i 

a 2 b 2 



which is called a determinant of the second order, and also called the deter- 
minant of the coefficients of x and y in equations (1). The results above may 
now be written in the form 



(3) 



«i h 
a 2 b 2 



k\ h 

k 2 b 2 



oi 6i 

a 2 b 2 



y 



a\ k\ 
a 2 k 2 



We shall call k\ and k 2 the known terms of our equations (1). Hence, if D is the 
determinant of the coefficients of the unknowns, the product of D by any one 
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of the unknowns is equal to the determinant obtained from D by substituting 
the known terms in place of the coefficients of that unknown. 
If D / 0, relations (3) uniquely determine values of x and y: 



x 



k\bi - k 2 bi 



y 



a \k<i - a 2 k\ 



D D 

and these values satisfy equations (1); for example, 

(ai&2 - a2b\)ki 



a\x + b\y 



D 



h- 



Hence our equations (1) have been solved by determinants when D / 0. We 
shall treat in §96 the more troublesome case in which D = 0. 



Example. For 2x - 3y 



2 


-3 


x = 


-4 


-3 


6 


-2 




2 


-2 






9 


-4 




1 


iy = 


6 


2 



4, 6x — 2y = 2, we have 
Ux = 14, 



28, 



y = 2. 



EXERCISES 

Solve by determinants the following systems of equations: 



1. 8x - y = 34, 

x + 8y = 53. 



2. 3x + 4y = 10, 

4x+ y = 9. 



3. ax + by 



bx — ay = a&. 



81. Solution of Three Linear Equations by Determinants of Or- 
der 3. Consider a system of three linear equations 



(4) 



a\x + b\y + c\z = fci, 
a 2 a; + 62 2/ + ^z = k 2 , 
a 3 x + b 3 y + czz = fc 3 . 



Multiply the members of the first, second and third equations by 
(5) b 2 C3-b 3 C2, &3C1-61C3, &1C2-&2C1, 

respectively, and add the resulting equations. We obtain an equation in which 
the coefficients of y and z are found to be zero, while the coefficient of x is 



(6) 



G4&2C3 - aibsC2 + a2&3Ci - 02^3 + 03^2 - «3^2Ci- 



382.1 



SIGNS OF THE TERMS OF A DETERMINANT 
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Such an expression is called a determinant of the third order and denoted by 
the symbol 



(6') 



oi 


h 


c\ 


(l 2 


b 2 


C'2 


a 3 


h 


c;i 



The nine numbers a\,...,c% are called the elements of the determinant. 
In the symbol these elements lie in three (horizontal) rows, and also in three 
(vertical) columns. Thus a 2 , b 2 , c 2 are the elements of the second row, while 
the three c's are the elements of the third column. 

The equation (free of y and z), obtained above, may now be written as 



a\ 


h 


c\ 




a 2 


h 


C'l 


X 


as 


bs 


CS 





h 


h 


Cl 


h 


b 2 


C'l 


h 


h 


c.'i 



since the right member was the sum of the products of the expressions (5) by 
kl, k 2 , ks, and hence may be derived from (6) by replacing the a's by the /c's. 
Thus the theorem of §80 holds here as regards the unknown x. We shall later 
prove, without the laborious computations just employed, that the theorem 
holds for all three unknowns. 



82. The Signs of the Terms of a Determinant of Order 3. In the 

six terms of our determinant (6), the letters a, b, c were always written in this 
sequence, while the subscripts are the six possible arrangements of the numbers 
1, 2, 3. The first term a\b 2 cs shall be called the diagonal term, since it is the 
product of the elements in the main diagonal running from the upper left-hand 
corner to the lower right-hand corner of the symbol (6 ; ) for the determinant. 
The subscripts in the term —016302 are derived from those of the diagonal 
term by interchanging 2 and 3, and the minus sign is to be associated with the 
fact that an odd number (here one) of interchanges of subscripts were used. 
To obtain the arrangement 2, 3, 1 of the subscripts in the term +026301 from 
the natural order 1, 2, 3 (in the diagonal term), we may first interchange 1 
and 2, obtaining the arrangement 2, 1, 3, and then interchange 1 and 3; an 
even number (two) of interchanges of subscripts were used and the sign of the 
term is plus. 

While the arrangement 1, 3, 2 was obtained from 1, 2, 3 by one interchange 
(2,3), we may obtain it by applying in succession the three interchanges (1,2), 
(1, 3), (1, 2), and in many new ways. To show that the number of interchanges 
which will produce the final arrangement 1, 3, 2 is odd in every case, note that 
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each of the three possible interchanges, viz., (1,2), (1,3), and (2,3), changes 
the sign of the product 

P = (x\- x 2 )(x\ - x 3 )(x 2 - x 3 ), 

where the x's are arbitrary variables. Thus a succession of k interchanges 
yields P or — P according as k is even or odd. Starting with the arrangement 
1, 2, 3 and applying k successive interchanges, suppose that we obtain the final 
arrangement 1, 3, 2. But if in P we replace the subscripts 1, 2, 3 by 1, 3, 2, 
respectively, i.e., if we interchange 2 and 3, we obtain —P. Hence k is odd. We 
have therefore proved the following rule of signs: 

Although the arrangement r, s, t of the subscripts in any term ±a r b s ct 
of the determinant may be obtained from the arrangement 1, 2, 3 by various 
successions of interchanges, the number of these interchanges is either always 
an even number and then the sign of the term is plus or always an odd number 
and then the sign of the term is minus. 

EXERCISES 

Apply the rule of signs to all terms of 

1. Determinant (6). 2. Determinant ai&2 — 02^1- 

83. Number of Interchanges always Even or always Odd. We now 

extend the result in §82 to the case of n variables x\,... ,x n . The product of 
all of their differences xj — Xj (i < j) is 

P=(xi- x 2 )(xi - x 3 ) • • -(xi - X n ) 

■ (X2 ~Xs) ■■■(X2 -X n ) 

\X n —\ — X n ). 

Interchange any two subscripts i and j. The factors which involve neither i 
nor j are unaltered. The factor (xi — Xj) involving both is changed in sign. 
The remaining factors may be paired to form the products 

H x i ~ x k )(xj - x k ) (k = l,...,n; k^i,k^j). 

Such a product is unaltered. Hence P is changed in sign. 

Suppose that an arrangement ii,i 2 , . . . ,i n can be obtained from 1,2,..., 
n by using m successive interchanges and also by t successive interchanges. 
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Make these interchanges on the subscripts in P; the resulting functions are 
equal to (— l) m P and (— 1)*P, respectively. But the resulting functions are 
identical since either can be obtained at one step from P by replacing the 
subscript 1 by i\, 2 by i 2 , ■ ■ ■ , n by i n . Hence 

(-l) m P = (-l) i P, 

so that m and t are both even or both odd. 

Thus if the same arrangement is derived from 1, 2,...,n by m successive 
interchanges as by t successive interchanges, then m and t are both even or 
both odd. 



84. Definition of a Determinant of Order n. We define a determinant 
of order 4 to be 



(7) 



oi 


h 


c\ 


dl 


a 2 


b 2 


C'l 


d 2 


a 3 


h 


C3 


d 3 


a 4 


b± 


CA 


G?4 



S jr±a q b r c s dt, 
(24) 



where q, r, s, t is any one of the 24 arrangements of 1, 2, 3, 4, and the sign of the 
corresponding term is + or — according as an even or odd number of inter- 
changes are needed to derive this arrangement q, r, s, t from 1, 2, 3, 4. Although 
different numbers of interchanges will produce the same arrangement q, r, s, t 
from 1, 2, 3, 4, these numbers are all even or all odd, as just proved, so that the 
sign is fully determined. 

We have seen that the analogous definitions of determinants of orders 2 
and 3 lead to our earlier expressions (2) and (6). 

We will have no difficulty in extending the definition to a determinant of 
general order n as soon as we decide upon a proper notation for the n 2 elements. 
The subscripts 1,2, ... ,n may be used as before to specify the rows. But the 
alphabet does not contain n letters with which to specify the columns. The use 
of e', e", . . . , e^ n > for this purpose would conflict with the notation for derivatives 
and besides be very awkward when exponents are used. It is customary in 
mathematical journals and scientific books (a custom not always followed in 
introductory text books, to the distinct disadvantage of the reader) to denote 
the n letters used to distinguish the n columns by e\, e 2 , ■ ■ ■ , e n (or some other 
letter with the same subscripts) and to prefix (but see §85) such a subscript 
by the new subscript indicating the row. The symbol for the determinant is 
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therefore 



(8) 



D 



en ei2 

e21 e22 



ei n 

e2n 



e n l e n2 



By definition this shall mean the sum of the n(n — 1) • • • 2 • 1 terms 



(9) 



{-iye hl e i2 2---e 



i n n 



in which i\, 12, . . . , i n is an arrangement of 1, 2, . . . , n, derived from 1, 2, . . . , n by 
i interchanges. Any term (9) of the determinant (8) is, apart from sign, the 
product of n factors, one and only one from each column, and one and only 
one from each row. 

For example, if we take n = 4 and write a,j, bj, cj, dj for eji, ej2, ej%, ej4, the 
symbol (8) becomes (7) and the general term (9) becomes the general term 
(—lYai 1 bi 2 Ci 3 di 4 of the second member of (7). 



EXERCISES 

1. Find the six terms involving a<i in the determinant (7). 

2. What are the signs of 036502^164, 0564030^261 in a determinant of order five? 

3. Show that the arrangement 4,1,3,2 may be obtained from 1,2,3,4 by use of 
the two successive interchanges (1,4), (1,2), and also by use of the four successive 
interchanges (1,4), (1,3), (1,2), (2,3). 

4. Write out the six terms of (8) for n = 3, rearrange the factors of each term so 
that the new first subscripts shall be in the order 1,2,3, and verify that the resulting 
six terms are those of the determinant D' in S85 for n = 3. 



85. Interchange of Rows and Columns. Any determinant is not al- 
tered in value if in its symbol we replace the elements of the first, second, 
. . . , nth rows by the elements which formerly appeared in the same order in the 
first, second, . . . , nth columns, or briefly if we interchange the corresponding 
rows and columns. For example, 



a b 
c d 



ad — be 



a c 
b d\ 
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We are to prove that the determinant D given by (8) is equal to 



D' 



en e 2 i • • • e n i 
ei2 e22 • • • e n 2 



ein e2 f 



If we give to Z)' a more familiar aspect by writing e^ = a^ for each element 
so that, as in (8), the row subscript precedes instead of follows the column 
subscript, the definition of the determinant in terms of the a's gives D' in 
terms of the e's as the sum of all expressions 

(- 1 Y e lk 1 e 2k 2 '" e nk n -> 

in which k\, k<i, ■ ■ ■ , k n is an arrangement of 1,2, ... ,n, derived from the latter 
sequence by i interchanges. 

As for the terms of D, without altering (9), we may rearrange its factors 
so that the first subscripts shall appear in the order 1,2, ... ,n, and obtain 

(- 1 Y e lk 1 e 2k 2 '" e nk n - 

This can be done by performing in reverse order the i successive interchanges of 
the letters e corresponding to the i successive interchanges which were used to 
derive the arrangement i\,%2, ■ ■ ■ ,in of the first subscripts from the arrangement 
1, 2, . . . , n. Thus the new second subscripts k\, . . . , k n are derived from the old 
second subscripts l,...,n by i interchanges. The resulting signed product is 
therefore a term of D' . Hence D = D' . 



86. Interchange of Two Columns. A determinant is merely changed 
in sign by the interchange of any two of its columns. For example, 



D 



a b 
c d 



ad — be, 



A 



\b a\ 
\d c\ 



be — ad 



-D. 



Let A be the determinant derived from (8) by the interchange of the rth 
and sth columns. The terms of A are therefore obtained from the terms (9) 
of D by interchanging r and s in the series of second subscripts. Interchange 
the rth and sth letters e to restore the second subscripts to their natural order. 
Since the first subscripts have undergone an interchange, the negative of any 
term of A is a term of D, and A = —D. 
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87. Interchange of Two Rows. A determinant D is merely changed in 
sign by the interchange of any two rows. 

Let A be the determinant obtained from D by interchanging the rth and 
sth rows. By interchanging the rows and columns in D and in A, we get two 
determinants D' and A', either of which may be derived from the other by the 
interchange of the rth and sth columns. Hence, by §§85, 86, 



A = A' = -D' 



-D. 



88. Two Rows or Two Columns Alike. A determinant is zero if any 
two of its rows or any two of its columns are alike. 

For, by the interchange of the two like rows or two like columns, the deter- 
minant is evidently unaltered, and yet must change in sign by §§86, 87. Hence 
D = -D, D = 0. 

EXERCISES 



1. Prove that the equation of the straight line determined by the two distinct 
points (xi,yi) and (0:2,3/2) is 



0. 



2. Show that 



X 


y 


1 


Xl 


V\ 


1 


x 2 


2/2 


1 



a 1 


&i 


c\ 




0,2 


b,_ 


C-2 


= 


a 3 


h 


C3 





«2 


C-2 


t>2 




Ol 


Cl 


bi 


= 


a 3 


C3 


b ?> 





a 3 


Ol 


«2 


b 3 


&i 


6 2 


C3 


Cl 


c 2 



By use of the Factor Theorem (§14) and the diagonal term, prove that 



a 

„2 



1 

b 
b 2 



(6 — a)(c — a)(c — b). 



1 


1 


1 


Xl 


^2 


2-n 


x\ 


x 2 


x n 


xr 1 


x 2 


™n— 1 



n ( x *- x i)- 



i>j 



This is known as the determinant of Vandermonde, who discussed it in 1770. The 
symbol on the right means the product of all factors of the type indicated. 
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5. Prove that a skew-symmetric determinant of odd order is zero: 














a 


b 


c 


d 





a 


b 




— a 





e 


/ 


5 


a 





c 


= 0, 


-b 


— e 





h 


i 


b 


— c 







—c 


-/ 


-ft 





fc 










-d 


-9 


-J 


-jfe 






89. Minors. The determinant of order n — 1 obtained by erasing (or 
covering up) the row and column crossing at a given element of a determinant 
of order n is called the minor of that element. 

For example, in the determinant (6') of order 3, the minors of b\, b 2 , 63 are 
respectively 

Bi 
Again, (6 ; ) is the minor of d^ in the determinant of order 4 given by (7). 



a-2 


C'2 


, B 2 = 


a- L 


c\ 


, B 3 = 


ai 


Cl 


0.3 


C3 




a 3 


C3 




0-2 


C2 



90. Expansion According to the Elements of a Row or Column. 



In 



(6') 



D 



(ll 


h 


Cl 


a 2 


h 


C'l 


a 3 


h 


c;i 



denote the minor of any element by the corresponding capital letter, so that 
b\ has the minor B\, 63 has the minor S3, etc., as in §89. We shall prove that 



D= a\Ai -b 1 B 1 + C1C1, 

D = -C12A2 + 62-B2 - C2C2, 
D = a 3 A :i - bsB :i + C3C3, 



D = a\Ai - a 2 A 2 + a 3 A 3 , 
D = -61S1 + b 2 B 2 - b 3 B 3 , 
D= ciCi - c 2 C 2 + C3C3. 



The three relations at the left (or right) are expressed in words by saying that 
a determinant D of the third order may be expanded according to the elements 
of the first, second or third row {or column). To obtain the expansion, we 
multiply each element of the row (or column) by the minor of the element, 
prefix the proper sign to the product, and add the signed products. The signs 
are alternately + and — , as in the diagram 



+ - + 
- + - 

+ - + 
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For example, by expansion according to the second column, 



1 4 5 




2 3 


2 3 


= -4 


3 9 


3 9 







-4x9 



-36. 



Similarly the value of the determinant (7) of order 4 may be found by expansion 
according to the elements of the fourth column: 



ch 



a 2 b 2 


('2 




03 h 


('■A 


+ d 2 


(24 64 


C 4 





01 


61 Cl 




03 


^3 c 3 


-Gf 3 


0,4 


64 C4 





ai b\ c\ 




a 2 b 2 c 2 


+ <i 4 


04 64 C4 





ai 


61 


Cl 


a 2 


62 


C2 


a 3 


&3 


C3 



We shall now prove that any determinant D of order n may be expanded 
according to the elements of any row or any column. 

Let Eij denote the minor of e^ in D, given by (8), so that E^ is obtained 
by erasing the ith row and jth column of D. 

(i) We first prove that 



(10) 



D = eii-En - e 2 iE 2 i + e 3 i£ , 3 i 



+ (-!)' 



enl-^nl 



so that D may be expanded according to the elements of its first column. 
By (9) the terms of D having the factor en are of the form 

(-iye u e i2 2---e inn , 

where 1,12, ... ,i n is an arrangement of 1,2, ... ,n, obtained from the latter by 
i interchanges, so that i 2 , . . . , i n is an arrangement of 2, . . . , n, derived from the 
latter by i interchanges. After removing from each term the common factor en 
and adding the quotients, we obtain a sum which, by definition, is the value of 
the determinant E\\ of order n — 1. Hence the terms of D having the factor en 
may all be combined into e\\E\\, which is the first part of (10). 

We next prove that the terms of D having the factor e 2 i may be combined 
into — e2iE 2 i, which is the second part of (10). For, if A be the determinant 
obtained from D by interchanging its first and second rows, the result just 
proved shows that the terms of A having the factor e 2 \ may be combined into 
the product of e 2 \ by the minor 



ei2 ei 3 ••• e Xn 

e 32 e33 ... &3n 



of e 2 i in A. Now this minor is identical with the minor E 2 \ of e 2 \ in D. But 
A = — D (§87). Hence the terms of D having the factor e 2 \ may be combined 
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into —e 2 \E 2 \. Similarly, the terms of D having the factor e%\ may be combined 
into ez\E%\, etc., as in (10). 

(ii) We next prove that D may be expanded according to the elements of 
its kth column (k > l): 



(11) 






Consider the determinant 5 derived from D by moving the kth column over 
the earlier columns until it becomes the new first column. Since this may be 
done by k — 1 interchanges of adjacent columns, 5 = (—l) k ~ 1 D. The minors 
of the elements e^, . . . , e n & in the first column of 5 are evidently the minors 
#lfc, ■ ■ ■ , E nk o f eifc, . . . , e nk in D. Hence, by (10), 



\n-l. 



8 = e.\k E \k ~ &2k E 2k -\ \- ("IT le nk E nk = / X~ l 



e jk^jk- 



Thus D = (—l) k 1 5 has the desired value (11). 

(Hi) Finally, D may be expanded according to the elements of its kth row: 

71 

D = J2(-l) J+k ekjE kj - 

i=i 

In fact, by Case (ii), the latter is the expansion of the equal determinant D' 
in §85 according to the elements of its kth column. 



91. Removal of Factors. A common factor of all of the elements of the 
same row or same column of a determinant may be divided out of the elements 
and placed as a factor before the new determinant. 

In other words, if all of the elements of a row or column are divided by n, 
the value of the determinant is divided by n. For example, 



nai nb\\ 
a-2 b2 



n 



02 bi 



a\ nb\ c\ 




a 2 nb 2 c 2 


= n 


a 3 n6 3 c 3 





ai b\ 


Cl 


a<i b 2 


c 2 


a3 ^3 


C3 



Proof is made by expanding the determinants according to the elements of 
the row or column in question and noting that the minors are the same for the 
two determinants. Thus the second equation is equivalent to 

-(nb 1 )B 1 + (n& 2 )# 2 ~ ( n h)B S = n(-b 1 B 1 + b 2 B 2 - b s B 3 ), 
where Bj denotes the minor of 6,- in the final determinant. 
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EXERCISES 



1. 



3 a 


36 3c 


5 a 


56 5c 


d 


e f 



2. 



2r 


I 


3r 


2 s 


rn 


3.s 


It 


n 


3* 



Expand by the shortest method and evaluate 





2 


7 3 




3. 


5 


9 8 









3 






a 


bed 


5. 


a 2 


6 2 c 2 d 2 


a 3 


6 3 c 3 d 3 




a 4 


6 4 


c 4 d 4 



4. 



5 7 





6 8 





3 9 


4 



abcd{a — 6) (a — c)(a — d)(b — c)(b — d){c — d). 



92. Sum of Determinants. A determinant having a\ + q\, 02 + 92 > • • • 
as i/ie elements of a column is equal to the sum of the determinant having a\, 
a,2, ■ ■ ■ as the elements of the corresponding column and the determinant having 
qi, g2,... as the elements of that column, while the elements of the remaining 
columns of each determinant are the same as in the given determinant. 

For example, 



ai + gi 61 ci 

a 2 + °2 h C 2 



01 


h 


c\ 




a 2 


h 


C'l 


+ 


«3 


h 


C3 





91 h 


c\ 


Q2 h 


c 2 


93 ^3 


C3 



To prove the theorem we have only to expand the three determinants ac- 
cording to the elements of the column in question (the first column in the 
example) and note that the minors are the same for all three determinants. 
Hence a\ + q\ is multiplied by the same minor that a\ and q\ are multiplied 
by separately, and similarly for a 2 + q 2 , etc. 

The similar theorem concerning the splitting of the elements of any row 
into two parts is proved by expanding the three determinants according to the 
elements of the row in question. For example, 



a + r b + s 
c d 



c d c d 
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93. Addition of Columns or Rows. A determinant is not changed in 
value if we add to the elements of any column the products of the corresponding 
elements of another column by the same arbitrary number. 

Let oi, Q2, ■ ■ ■ be the elements to which we add the products of the elements 
&1, 62, • • • by n. We apply §92 with q\ = nb\, q^ = 1162, • • • • Thus the modified 
determinant is equal to the sum of the initial determinant and a determinant 
having b\, 62 > • • ■ i n one column and nb\, n&2, ... in another column. But (§91) 
the latter determinant is equal to the product of n by a determinant with two 
columns alike and hence is zero (§88). For example, 



a 1 


h 


ci 




a 2 


b-2 


c 2 


+ n 


a 3 


h 


C3 





h 


h 


c\\ 


b-i 


b 2 


(■'1 


h 


h 


c;i 



a\ + nb\ b\ c\ 
a 2 + nb 2 b 2 c 2 
a 3 + nb 3 63 c 3 

and the last determinant is zero. 

Similarly, a determinant is not changed in value if we add to the elements 
of any row the products of the corresponding elements of another row by the 
same arbitrary number. 

For example, 

a + nc b + nd 
c d 





a b 




c d 




a b 


= 




+ n 




— 






c d 




c d 




c d 



Example. Evaluate the first determinant below. 



1 


-2 1 




1 1 




1 


2 3 


= 


1 8 3 


= 


6 


4 3 




6 10 3 





10 



8 
10 



-44. 



Solution. First we add to the elements of the second column the products of 
the elements of the last column by 2. In the resulting second determinant, we add 
to the elements of the first column the products of the elements of the third column 
by — 1 . Finally, we expand the resulting third determinant according to the elements 
of its first row. 

EXERCISES 



1. Prove that 



b + c c + a a + b 
9i + c\ c\ + a\ a\ + bi = 2 

h + c 2 c 2 + a 2 a 2 + b 2 



a 


b 


c 


a, 1 


h 


c\ 


a 2 


b 2 


C'l 
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By reducing to a determinant of order 3, etc., prove that 



(a - b)(a - c)(o - d)(b -c)(b- d)(c - d). 



3. 



1 


1 


1 


1 




a 


6 


c 


d 




a 2 


6 2 


c 2 


d 2 




a 3 


b 3 


c 3 


d 3 




2 


-1 


3 


-2 


1 


7 


1 


-1 


3 


5 


-5 


3 


4 


-3 


2 


— 


1 



-42. 



4. 



1 


1 


1 


1 


1 


2 


3 


4 


1 


3 


6 


10 


1 


4 


10 


20 



94. System of n Linear Equations in n Unknowns with D/0. In 
onxi + 012x2 + h ai„x„ = fei, 

a n ixi + a n2 Z2 H 1- a nn x n = k n , 

let D denote the determinant of the coefficients of the n unknowns: 



(12) 



D 



an «12 ••• «ln 



Then 



Dx] 



auxi a 12 •■ 



«lr 



auxi + a±2X 2 + ■ 


T Oln^-n 


0-12 ■ 


■ Uln 


OniXi + a n2 x 2 + • 


T OinnXn 


a n 2 ■ 


Q"nn 



where the second determinant was derived from the first by adding to the 
elements of the first column the products of the corresponding elements of the 
second column by X2, etc., and finally the products of the elements of the last 
column by x n . The elements of the new first column are equal to hi, . . . , k n 
by (12). In this manner, we find that 



(13) 



Dxi = Ki, 



Dx 2 = K 2 , 



U%n — "-rii 



in which Ki is derived from D by substituting ki,...,k n for the elements 
an, ... , a n i of the ith column of D, whence 



Ki 



h an 

kn a n2 



air 



K,, 



« 11 
Vhil 



ain-1 h 
ann—1 fan 
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If D / 0, the unique values of x\, . . . ,x n determined by division from (13) 
actually satisfy equations (12). For instance, the first equation is satisfied since 



k\D - a\\K\ - ai^Ki 



ainK r , 



h 


an 


ai2 • 


■ ain 


h 


an 


an ■ 


■ ain 


k 2 


«21 


«22 • 


• 0,2n 


kn 


«nl 


a n 2 ■ 


ann 



as shown by expansion according to the elements of the first row; and the 
determinant is zero, having two rows alike. 

THEOREM. If D denotes the determinant of the coefficients of the n un- 
knowns in a system of n linear equations, the product of D by any one of 
the unknowns is equal to the determinant obtained from D by substituting the 
known terms in place of the coefficients of that unknown. If D / 0, we obtain 
the unique values of the unknowns by division by D. 

We have therefore given a complete proof of the results stated and illus- 
trated in §80, §81. Another proof is suggested in Ex. 7 below. The theorem 
was discovered by induction in 1750 by G. Cramer. 



EXERCISES 

Solve by determinants the following systems of equations (reducing each deter- 
minant to one having zero as the value of every element but one in a row or column, 
as in the example in §93). 



x + y + z = 11, 
2x — 6y — z = 0, 
3x + Ay + 2z = 0. 



x + y + z = 0, 
x + 2y + 3z = — 1, 
x + 3y + 6z = 0. 



3. x-2y+ z = 12, 

x + 2y + 3z = 48, 

6x + Ay + 3z = 84. 



4. 3x-2y = 7, 
3y — 2z = 6, 
3z-2x = -1. 



x + y + z + w = 1, 
x + 2y + 3z+ Aw = 11, 
x + 3y + 6z + lOw = 26, 
x + Ay + 10z + 20w = 47. 



2x- y + 3,z-2u> = 4, 

x + 7y+ z — w = 2, 
3a; + 5y — 5z + 3w = 0, 
4x — 3y + 2z — u; = 5. 
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7. Prove the first relation (13) by multiplying the members of the hrst equa- 
tion (12) by An, those of the second equation by —A21, ■ ■ ■ , those of the nth equation 
by ( — l) n ~ 1 A n i, and adding, where Aij by denotes the minor of aij in D. Hint: The 
resulting coefficient of #2 is the expansion, according to the elements of its first 
column, of a determinant derived from D by replacing an by ai2, • • • , o-ni by a n 2- 



95. Rank of a Determinant. If we erase from a determinant D of 
order n all but r rows and all but r columns, we obtain a determinant of 
order r called an r-rowed minor of D. In particular, any element is regarded 
as a one-rowed minor, and D itself is regarded as an n-rowed minor. 

If a determinant D of order n is not zero, it is said to be of rank n. If, 
for < r < n, some r-rowed minor of D is not zero, while every (r + l)-rowed 
minor is zero, D is said to be of rank r. It is said to be of rank zero if every 
element is zero. 

For example, a determinant D of order 3 is of rank 3 if D 7^ 0; of rank 2 if D = 0, 
but some two-rowed minor is not zero; of rank 1 if every two-rowed minor is zero, 
but some element is not zero. Again, every three-rowed minor of 



a b 


c d 


e f 


9 h 


a b 


c d 


e f 


9 h 



is zero since two pairs of its rows are alike. Hence it is of rank 2 if some two-rowed 
minor is not zero. But it is of rank 1 if a, b, c, d are not all zero and are proportional 
to e, /, g, h, since all two-rowed minors are then zero. 

96. System of n Linear Equations in n Unknowns with D = 0. 

We shall now discuss the equations (12) for the troublesome case (previously 
ignored) in which the determinant D of the coefficients of the unknowns is 
zero. In view of (13), the given equations are evidently inconsistent if any one 
of the determinants K\, . . . , K n is not zero. But if D and these K 's are all zero, 
our former results (13) give us no information concerning the unknowns xj, 
and we resort to the following 

THEOREM. Let the determinant D of the coefficients of the unknowns 
in equations (12) be of rank r, r < n. If the determinants K obtained from 
the (r + 1) -rowed minors of D by replacing the elements of any column by the 
corresponding known terms k{ are not all zero, the equations are inconsistent. 
But if these determinants K are all zero, the r equations involving the elements 
of a non-vanishing r-rowed minor of D determine uniquely r of the unknowns 
as linear functions of the remaining n — r unknowns, which are independent 
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variables, and the expressions for these r unknowns satisfy also the remaining 
n — r equations. 

Consider for example the three equations (4) in the unknowns x, y, z. Five cases 
arise: 

(a) D of rank 3, i.e., D/0. 

(/?) D of rank 2 (i.e., D = 0, but some two-rowed minor / 0), and 



A'i 



k\ h 


C\ 


&2 &2 


C-2 


&3 &3 


C.3 



^2 



ai 


fci 


c\ 


0-2 


A^2 


C-2 


a 3 


A^3 


C3 



#3 



ai &i 


fci 


a2 &2 


fc 2 


a3 &3 


^3 



Oj 


kj 




h 


k'i 




Ci 


kj 


Oj 


kj 


i 


h 


k :i 


•) 


C J 


kj 



not all zero. 
(7) D of rank 2 and K\, K2, K3 all zero. 
(5) D of rank 1 (i.e., every two- rowed minor = 0, but some element 7^ 0), and 

(i, j chosen from 1, 2, 3) 

not all zero; there are nine such determinants K. 
(e) D of rank 1, and all nine of the two- rowed determinants K zero. 

In case (a) the equations have a single set of solutions (§94). In cases {(5) and (5) 
there is no set of solutions. For (/?) the proof follows from (13). In case (7) one of the 
equations is a linear combination of the other two; for example, if 0162 — ^2^1 / 0, 
the first two equations determine x and y as linear functions of z (as shown by 
transposing the terms in z and solving the resulting equations for x and y), and the 
resulting values of x and y satisfy the third equation identically as to z. Finally, 
in case (e), two of the equations are obtained by multiplying the remaining one by 
constants. 

The reader acquainted with the elements of solid analytic geometry will see that 
the planes represented by the three equations have the following relations: 

(a) The three planes intersect in a single point. 

(/?) Two of the planes intersect in a line parallel to the third plane. 

(7) The three planes intersect in a common line. 

(5) The three planes are parallel and not all coincident. 

(e) The three planes coincide. 
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The remarks preceding our theorem furnish an illustration (the case r = 
n — 1) of the following 

LEMMA 1. If every (r + l)-rowed minor M formed from certain r+1 rows 
of D is zero, the corresponding r + 1 equations (12) are inconsistent provided 
there is a non-vanishing determinant K formed from any M by replacing the 
elements of any column by the corresponding known terms k\ . 

For concreteness, 1 let the rows in question be the first r + 1 and let 



K 



an 



aii 



h 



Tr+ll 



V+lr "V+l 



^0. 



Let d\, . . . , d r +i be the minors of hi, . . . , fc r +l m K. Multiply the first r + 1 
equations (12) by di, -d 2 , ■ ■ ■ ,(-l) r d r+ i, respectively, and add. The right 
member of the resulting equation is the expansion of ±K. The coefficient of x s 
is the expansion of 

an • • • ai r a ls 



a r +ll • • • O-r+lr Or+ls 

and is zero, being an M if s > r, and having two columns identical if s ^ r. 
Hence = ±K. Thus if K ^ 0, the equations are inconsistent. 

LEMMA 2. If all of the determinants M and K in Lemma 1 are zero, but 
an r -rowed minor of an M is not zero, one of the corresponding r + 1 equations 
is a linear combination of the remaining r equations. 

As before let the r + 1 rows in question be the first r + 1. Let the non- 
vanishing r-rowed minor be 



(14) 



d 



r+1 



an 



aii 



a rl 



^0. 



Let the functions obtained by transposing the terms fcj in (12) be 
Li = anxi + ai2X2 H \- a in x n - k{. 



By the multiplication made in the proof of Lemma 1, 

di-Li - d 2 L 2 H h (-l) r d r+1 L r+1 = +K 

Hence L r+ i is a linear combination of L\, . . . ,L r . 



0. 



All other cases may be reduced to this one by rearranging the n equations and relabelling 
the unknowns (replacing X3 by the new x\, for example). 
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The first part of the theorem is true by Lemma 1. The second part is 
readily proved by means of Lemma 2. Let (14) be the non-vanishing r-rowed 
minor of D. For s > r, the sth equation is a linear combination of the first 
r equations, and hence is satisfied by any set of solutions of the latter. In 
the latter transpose the terms involving x r +i, . . . , x n . Since the determinant of 
the coefficients of x\, . . . , x r is not zero, §94 shows that x\, . . . , x r are uniquely 
determined linear functions of x r +i,...,x n (which enter from the new right 
members). 



EXERCISES 

Apply the theorem to the following four systems of equations and check the 
conclusions: 



1. 2x+ y + 3z = 1, 
Ax + 2y - z = -3, 

2x+ y - Az = -A. 



2. 2x+ y + 3z = 1, 
Ax + 2y- z = 3, 
2x+ y - Az = A. 



x — 3y + Az = 1, 
Ax- 12y+16z = 3, 
3x - 9y+ Viz = 3. 



x — 3y + Az = 1, 
Ax - 12y + 16z = A, 
3x - 9y + Viz = 3. 



5. Discuss the system 



ax + y + z = a — 3, 

x + ay + z = —2, 
x + y + az = -2, 



when (i) a = 1; (m) a 
unknowns. 



-2; (m) a/ 1, —2, obtaining the simplest forms of the 



6. Discuss the system 



x + y + z = 1, 

ax + by + oz = fc, 



ax+by+cz 



k z 



when (?) a, o, c are distinct; (m) a = b / c; (m) a 



c. 
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97. Homogeneous Linear Equations. When the known terms k\, 
...,k n in (12) are all zero, the equations are called homogeneous. The de- 
terminants K are now all zero, so that the n homogeneous equations are never 
inconsistent. This is also evident from the fact that they have the set of solu- 
tions x\ — 0, . . . ,x n = 0. By (13), there is no further set of solutions if D / 0. 
If D = 0, there are further sets of solutions. This is shown by the theorem 
of §96 which now takes the following simpler form. 

If the determinant D of the coefficients of n linear homogeneous equations 
in n unknowns is of rank r, r < n, the r equations involving the elements of 
a non-vanishing r-rowed minor of D determine uniquely r of the unknowns 
as linear functions of the remaining n — r unknowns, which are independent 
variables, and the expressions for these r unknowns satisfy also the remaining 
n — r equations. 

The particular case mentioned is the much used theorem: 

A necessary and sufficient condition that n linear homogeneous equations 
in n unknowns shall have a set of solutions, other than the trivial one in which 
each unknown is zero, is that the determinant of the coefficients be zero. 



EXERCISES 



Discuss the following systems of equations: 



1. x + y + 3z = 0, 

x + 2y + 2z = 0, 
x + by — z = 0. 



2. 2x- y+ 4z = 0, 
x + 3y - 2z = 0, 
x- Uy+ Uz = 0. 



x — 2>y + Az = 0, 
Ax- 12y+ lQz = 0, 
3x - 9y+ Viz = 0. 



6x + Ay + 3z - 8Aw = 0, 
x + 2y + 3z- A8w = 0, 
x-2y+ z-12w = ®, 

Ax + Ay- z- 2Aw = 0. 



2x+ 3y - Az+ 5w = 0, 
3a: + 5y — z + 2w = 0, 
7x+lly- 9z + 12w = 0, 
3x+ Ay - llz + 13w = 0. 
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98. System of m Linear Equations in n Unknowns. The case m < n 
may be treated by means of the lemmas in §96. If m > n, we select any n of the 
equations and apply to them the theorems of §§94, 96. If they are found to be 
inconsistent, the entire system is evidently inconsistent. But if the n equations 
are consistent, and if r is the rank of the determinant of their coefficients, we 
obtain r of the unknowns expressed as linear functions of the remaining n — r 
unknowns. Substituting these values of these r unknowns in the remaining 
equations, we obtain a system of m — n linear equations inn-r unknowns. 
Treating this system in the same manner, we ultimately either find that the 
proposed m equations are consistent and obtain the general set of solutions, 
or find that they are inconsistent. To decide in advance whether the former 
or latter of these cases will arise, we have only to find the maximum order r 
of a non-vanishing r-rowed determinant formed from the coefficients of the 
unknowns, taken in the regular order in which they occur in the equations, 
and ascertain whether or not the corresponding (r + l)-rowed determinants K, 
formed as in §96, are all zero. 

The last result may be expressed simply by employing the terminology 
of matrices. The system of coefficients of the unknowns in any set of linear 
equations 

anxi H h a in x n = ki, 

(15) 

arranged as they occur in the equations, is called the matrix of the coefficients, 
and is denoted by 

(a n ai2 ••■ ai n 


By annexing the column composed of the known terms ki we obtain the so- 
called augmented matrix 

fan «12 ••• ai n h 

B = 

\aml ^m2 ' ' ' a m n <^m/ 

The definitions of an r-rowed minor (determinant) of a matrix and of the 
rank of a matrix are entirely analogous to the definitions in §95. 

In view of Lemma 1 in §96, our equations (15) are inconsistent if B is of 
rank r + 1 and A is of rank ^ r. By Lemma 2, if A and B are both of rank r, 
all of our equations are linear combinations of r of them. Noting also that the 
rank r of A cannot exceed the rank of B, since every minor of A is a minor 
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of B, and hence a non-vanishing r-rowed minor of A is a minor of B, so that 
the rank of B is not less than r, we have the following 

THEOREM. A system of m linear equations in n unknowns is consistent 
if and only if the rank of the matrix of the coefficients of the unknowns is equal 
to the rank of the augmented matrix. If the rank of both matrices is r, certain 
r of the equations determine uniquely r of the unknowns as linear functions 
of the remaining n — r unknowns, which are independent variables, and the 
expressions for these r unknowns satisfy also the remaining m — r equations. 

When m — n + 1, B has an m-rowed minor called the determinant of the 
square matrix B. If this determinant is not zero, B is of rank m. Since A has 
no m-rowed minor, its rank is less than m. Hence we obtain the 

COROLLARY. Any system of n + 1 linear equations in n unknowns is 
inconsistent if the determinant of the augmented matrix is not zero. 



EXERCISES 

Discuss the following systems of equations: 

1. 2x+ y + 3z = 1, 2. 2x- y + 3z = 2, 

Ax + 2y — z = — 3, x + ly + z = 1, 

2x + y — 4z = —4, 3x + 5y — 5z = a, 

10x + 5y-6z = -10. 4x-3y + 2z = l. 

3. 4x - y+ z = 5, 4. 4x - 5y = 2, 

2x-3y + 5z = l, 2x + 3y = l2, 

x+ y-2z = 2, 10x-7y = 16. 

5x — z = 2. 



5. Prove the Corollary by multiplying the known terms by x n +i = 1 and apply- 
ing §97 with n replaced by n + 1. 

6. Prove that if the matrix of the coefficients of any system of linear homoge- 
neous equations in n unknowns is of rank r, the values of certain n—r of the unknowns 
may be assigned at pleasure and the others will then be uniquely determined and 
satisfy all of the equations. 
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99. Complementary Minors. The determinant 



(16) 



D 



01 


&1 


c\ 


dl 


02 


&2 


C'2 


d 2 


3 


&3 


c;i 


d 3 


a 4 


b A 


c 4 


G?4 



is said to have the two-rowed complementary minors 



M 



o-i h 


Af' = 


C2 d 2 


03 & 3 




C4 04 



since either is obtained by erasing from D all the rows and columns having an 
element which occurs in the other. 

In general, if we erase from a determinant D of order n all but r rows and 
all but r columns, we obtain a determinant M of order r called an r-rowed 
minor of D. But if we had erased from D the r rows and r columns previously 
kept, we would have obtained an (n — r)-rowed minor of D called the minor 
complementary to M. In particular, any element is regarded as a one-rowed 
minor and is complementary to its minor (of order n — l). 

100. Laplace's Development by Columns. Any determinant D is 
equal to the sum of all the signed products ±MM' , where M is an r-rowed 
minor having its elements in the first r columns of D, and M' is the minor 
complementary to M , while the sign is + or — according as an even or odd 
number of interchanges of rows of D will bring M into the position occupied by 
the minor M\ whose elements lie in the first r rows and first r columns of D. 

For r = 1 , this development becomes the known expansion of D according to the 
elements of the first column (§90); here M\ = e\\. 

If r = 2 and D is the determinant (16), 



D 



+ 



01 61 




C3 ^3 




I01 h 




C2 d2 




a\ 61 




C2 G?2 


a 2 6 2 




C4 di 


[03 h\ |c4 At " r 


04 64 


C3 G?3 


02 62 




C\ d\\ (22 62 




a d\ 


+ 


03 &3 


_ \c\ d\ 


«3 ^3 




C4 C?4 




04 64 




C3 d 3 


04 64 


' 


C2 G?2 



The first product in the development is M\M[; the second product is —MM' (in 
the notations of §99), and the sign is minus since the interchange of the second and 
third rows of D brings this M into the position of M\. The sign of the third product 
in the development is plus since two interchanges of rows of D bring the first factor 
into the position of M\. 
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If D is the determinant (8), then 



Mi 



en 



e r i 



ei, 



M{ 



e r+lr+l 



&nr+l 



'r+ln 



Any term of the product MiM{ is of the type 



(17) 



(-l)*ej 1 iej 2 2 • • • e irr • (-l) J e ir+ir+ i 



• e 



inii 



where i\,...,i r is an arrangement of 1, . . . , r derived from 1, . . . , r by i inter- 
changes, while i r +i, ■ ■ ■ ,in is an arrangement of r + 1, . . . , n derived by j in- 
terchanges. Hence i\, . . . ,i n is an arrangement of 1, . . . , n derived by i + j 
interchanges, so that the product (17) is a term of D with the proper sign. 

It now follows from §87 that any term of any of the products ±MM' men- 
tioned in the theorem is a term of D. Clearly we do not obtain twice in this 
manner the same term of D. 

Conversely, any term t of D occurs in one of the products ±MM'. Indeed, 
t contains as factors r elements from the first r columns of D, no two being in 
the same row, and the product of these is, except perhaps as to sign, a term of 
some minor M. Thus t is a term of MM' or of —MM'. In view of the earlier 
discussion, the sign of t is that of the corresponding term in ±MM', where the 
latter sign is given by the theorem. 

101. Laplace's Development by Rows. There is a Laplace develop- 
ment of D in which the r-rowed minors M have their elements in the first 
r rows of D, instead of in the first r columns as in §100. To prove this, we 
have only to apply §100 to the equal determinant obtained by interchanging 
the rows and columns of D. 

There are more general (but less used) Laplace developments in which the 
r-rowed minors M have their elements in any chosen r columns (or rows) of D. 
It is simpler to apply the earlier developments to the determinant ±D having 
the elements of the chosen r columns (or rows) in the new first r columns (or 
rows). 

EXERCISES 



1. Prove that 



a 


b c 


d 












e 


f 9 


h 




a b 




j 


k 





j 


k 




e f 




I 


rn 





/ 


m 













5102.1 



PRODUCT OF DETERMINANTS 



139 



2. By employing 2-rowed minors from the first two rows, show that 



abed 


























e f g h 
abed 




a b 
e / 




c d 
q h 




a c 
e g 




b d 
f h 

J 


+ 


a d 
e h 




b c 
f 9 


e f g h 



























0. 



3. By employing 2-rowed minors from the first two columns of the 4-rowed 
determinant in Ex. 2, show that the products in Laplace's development cancel. 



a b 




e f 




ae + bg af + bh 


c d 




9 h 




ce + dg cf + dh 



102. Product of Determinants. The product of two determinants of 
the same order is equal to a determinant of like order in which the element 
of the rth row and cth column is the sum of the products of the elements of 
the rth row of the first determinant by the corresponding elements of the cth 
column of the second determinant. 

For example, 

(18) 

While for brevity we shall give the proof for determinants of order 3, the 
method is seen to apply to determinants of any order. By Laplace's develop- 
ment with r = 3 (§101), we have 




91 

92 
9?> 



(19) 



oi 
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c\ 








0, 2 


b 2 


(■■l 








«3 
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C3 








-1 








ei 


h 





-1 





e 2 


h 








-1 


e3 


h 



01 


h 


c\ 




ei h 


0, 2 


b 2 


C'2 




&2 h 


0-3 


h 


C-A 




e'S h 



91 
92 
93 



In the determinant of order 6, add to the elements of the fourth, fifth, 
and sixth columns the products of the elements of the first column by e\, f\, 
31, respectively (and hence introduce zeros in place of the former elements 
e l ? fli g\ )• Next, add to the elements of the fourth, fifth, and sixth columns 
the products of the elements of the second column by e2, fii 52 5 respectively. 
Finally, add to the elements of the fourth, fifth, and sixth columns the products 
of the elements of the third column by e%, fy, g%, respectively. The new 
determinant is 

oiei + 6ie 2 + cie 3 01/1 + 61/2 + 01/3 oiffi + ^152 + C133 

a 2 ei + 6 2 e 2 + c 2 e 3 o 2 /i + 62/2 + C2/3 a 2 gi + 6252 + C253 

a 3 e! + 6 3 e 2 + c 3 e 3 03 /1 + 6 3 / 2 + c 3 / 3 a :i g 1 + b :i g 2 + c 3 3 3 









04 


h 


c\ 


0, 2 


b 2 


c 2 


03 


h 


C3 


-1 











-1 











-1 
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By Laplace's development (or by expansion according to the elements of the 
last row, etc.), this is equal to the 3-rowed minor whose elements are the long 
sums. Hence this minor is equal to the product in the right member of (19). 

EXERCISES 



1. Prove (18) by means of §92. 

2. Prove that, if s, t = a 1 + (3 i + 7', 



1 


1 


1 




1 a a 




3 


si s 2 


O: 


P 


7 




1 P P 2 


= 


«i 


S2 S 3 


■y 2 


B 2 


7 2 




1 7 7 2 




S2 


S 3 S 4 



3. If Ai, Bi, Ci are the minors of a», bi, Cj in the determinant D defined by the 
second factor below, prove that 



Ai 


-A 2 


A3 




ai 61 c\ 




D 








Si 


S 2 


-S3 




a 2 ^2 c 2 


= 





D 





Ci 


-c 2 


c 3 




<33 ^3 C 3 










19 



Hence the first factor is equal to D 2 if D 7^ 0. 

4. Express (a 2 + 6 2 + c 2 + d 2 )(e 2 + f 2 + g 2 + h 2 ) as a sum of four squares by 
writing 

a + bi c + di e + fi g + hi 
-c+ di a — bi —g + hi e — fi 

as a determinant of order 2 similar to each factor. Hint: If k' denotes the conjugate 
of the complex number k, each of the three determinants is of the form 

k I 
-V k! 



MISCELLANEOUS EXERCISES 

1. Solve 

ax + by + cz = k, 

ax+by+cz=k, 

4 1 l4 1 4 7 4 

by determinants for x, treating all cases. 

2. In three linear homogeneous equations in four unknowns, prove that the 
values of the unknowns are proportional to four determinants of order 3 formed 
from the coefficients. 



Factor the following determinants: 
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1 


a 


be 


1 


b 


ca 


1 


c 


ab 



X 


X 2 


yz 




y 


y 2 


xz 


= 


z 


z 2 


xy 





5. 

a b c 

c a b — (a + b + c)(a + boo + clo )(a + bto + cut), 
b c a 

where u is an imaginary cube root of unity. 
6. 

8. If the points (x\, yi), . . . , (3:4, y 4 ) lie on a circle, prove that 



a 


b 


c 


d 


b 


a 


d 


c 


c 


d 


a 


b 


d 


c 


b 


a 



a 


b 


c 


d 


d 


a 


b 


c 


c 


d 


a 


b 


b 


c 


d 


a 



x\ + y\ xi yi 1 



x\ + y\ x 4 y 4 1 



0. 



9. Prove that 







aa! + W + cc' 


eaf + fbf + 


gd 










ae> + bf + eg' ee> + ff + gg> 




a b 
e f 




a' b' 

e' r 


+ 


a c 
e g 




a' d 

e' g' 


+ 


b c 
f 9 




b' d 

f g' 





10. Prove that the cubic equation 

a — x 
D(x) = b 
c 



b 
f - x 



c 

9 






has only real roots. Hints: 



X 3 


1 


-</ 


1 


z i 


1 



D(x)-D(-x) 



a 2 + b 2 + c 2 - x 2 
ab + bf + eg 
ac+bg + ch 



ab+bf + eg 
b 2 + f 2 + g 2 - 
bc + fg + gh 



ac+bg + ch 
bc + fg + gh 
c 2 + g 2 + h 2 - x 2 



= -x 6 + x 4 {a 2 + f 2 + h 2 + 2b 2 + 2c 2 + 2g 2 ) - x 2 (D 1 + D 2 + D 3 ) + D 2 (0), 

where D3 denotes the first determinant in Ex. 9 with all accents removed and with 
e = b, while D\ and D2 are analogous minors of elements in the main diagonal of 
the present determinant of order 3 with x — 0. Hence the coefficient of —x 2 is a 
sum of squares. Since the function of degree 6 is not zero for a negative value of x 2 , 
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D{x) = has no purely imaginary root. If it had an imaginary root r + si, then 
D{x + r) = would have a purely imaginary root si. But D(x + r) is of the form 
D{x) with a, f, h replaced by a — r, f — r, h — r. Hence D{x) = has only real 
roots. The method is applicable to such determinants of order n. 

11. If a±, . . . , a n are distinct, solve the system of equations 

+ Z - + --- + Z ~ = 1 (» = l,...,n). 



hi — a\ hi — 0,2 

Hint: Regard ki,...,k n as the roots of an equation of degree n in k formed from the 
typical one above by substituting k for k{ and clearing of fractions; write k = cij — t, 
and consider the product of the roots of t n + • • • = 0. Hence find Xj. 



12. Solve the equation 



a + x x x 

x b + x x 

\ X X C + X 



CHAPTER IX 

Symmetric Functions 

103. Sigma Functions, Elementary Symmetric Functions. A ratio- 
nal function of the independent variables xi,X2, ■ ■ ■ ,x n is said to be symmetric 
in them if it is unaltered by the interchange of any two of the variables. For 
example, 

9 9 9 

Xj + x 2 + x 3 + 4xi + 4x2 + 4x3 

is a symmetric polynomial in x\, X2, X3; the sum of the first three terms is 
denoted by Ex 2 and the sum of the last three by 4£xi. In general, if t is a 
rational function of x\, . . . ,x n , Si denotes the sum of t and all of the distinct 
functions obtained from t by permutations of the variables; such a S-function 
(read sigma function) is symmetric in x\, . . . , x n . 

For example, if there are three independent variables a, /3, 7, 

Ea/3 = a/3 + 07 + £7, £a 2 /3 = a 2 /3 + a/3 2 + a 2 7 + a7 2 + /3 2 7 + /?7 2 , 

1111 fl /? a /3 7 a 7 

aap7 aa/?7P7a 

a 2 + /3 2 _ a 2 + /3 2 a 2 + 7 2 /3 2 + 7 2 
a/3 a/3 07 /?7 

In particular, Sa = a + /3 + 7, Sa/3, and a/?7 are called the three elementary 
symmetric functions of a, /?, 7. In general, 

Sai, Saia2, Saia2a3, . . . , Eaia2 • • -a n _i, aia2---a ri 

are the elementary symmetric functions of a\, cc2,---,<x n - I n §20 they were 
written out more fully and proved to be equal to — c\, C2, — C3, . . . , (— l) n c n if 
a\, . . . , a n are the roots of the equation 

(1) x n + Cl x n - X + c 2 x n ~ 2 + • • • + c n = 

whose leading coefficient is unity. 
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EXERCISES 

If a, 3, 7 are the roots of x 3 + px 2 + qx + r = 0, so that Ea = —p, T,a3 = q, 
and ad'j = — r, prove that 

1. (Sa) 2 = Sa 2 + 2Ea/3, whence Ea 2 = p 2 - 2q. 

2. Sa • Ea/3 = Ea 2 /3 + 3a/?7, whence Sa 2 /3 = 3r — pq. 

3. Ea 2 /?7 = pr. 

4. Ea 2 /3 2 = (Ea/3) 2 - 2a/?7Ea = q 2 - 2pr. 

If a, 3, 7, 5 are the roots of x 4 + px 3 + qx 2 + rx + s = 0, prove that 

1 -r 1 g 1 r 2 -2qs 

5. E-= — , £—- = -, E^ = ^ ■ 

a s ap s a z s z 

Hint: Compute the sum, sum of the products two at a time, and sum of the squares 
of the roots of the equation 

1 + py + qy 2 + ry 3 + sy 4 = 0, 

obtained by replacing x by 1/y in the given quartic equation. 

3 1 pr 
o. E — = La • E 4 = 4. 

a as 

n ^a 2 + 3 2 3 



E — = -^{qr 2 -2q 2 s - prs + 4s 2 ). 
a s z 





a/3 




a 


8. 




Ea/3 


•e1 

a- 


9. 


E^ = 
a/3 


3r — 

s 


p<2 



104. Fundamental Theorem on Symmetric Functions. Any poly- 
nomial symmetric in x\,...,x n is equal to an integral rational function, with 
integral coefficients, of the elementary symmetric functions 

(2) E\ = Exi, E2 = T,x\X2, E$ = Y,xiX2X3, . . . , E n = x\xi ■ ■ ■ x n 

and the coefficients of the given polynomial. In particular, any symmetric 
polynomial with integral coefficients is equal to a polynomial in the elementary 
symmetric functions with integral coefficients. 
For example, if n = 2, 

rxi + rx 2 + sx\ + sx 2 = r(E 1 — 2E 2 ) + sE\. 
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In case r and s are integers, the resulting polynomial in E\ and Ei has integral 
coefficients. 

The theorem is most frequently used in the equivalent form: 

Any polynomial symmetric in the roots of an equation, 

x n - E lX n - 1 + E 2 x n ~ 2 -■■■ + {-l) n E n = 0, 

is equal to an integral rational function, with integral coefficients, of the coef- 
ficients of the equation and the coefficients of the polynomial. 

It is this precise theorem that is required in all parts of modern algebra 
and the theory of numbers, where attention to the nature of the coefficients 
is vital, rather than the inadequate, oft-quoted, theorem that any symmetric 
function of the roots is expressible (rationally) in terms of the coefficients. 

It suffices to prove the theorem for any homogeneous symmetric polyno- 
mial S, i.e., one expressible as a sum of terms 

It — iAiJb -1 Jucy Ju y-. 

of constant total degree k = k\ + k 2 + • • • + k n in the x's. Evidently we may 
assume that no two terms of S have the same set of exponents k\, . . . , k n (since 
such terms may be combined into a single one). We shall say that h is higher 
than the term bx^x 2 2 • ■ ■ xlfi if k\ > h, or if k\ — h, k 2 > hi or if fa = hi fa = hi 
fa > 1%, . . . , so that the first one of the differences k\ — h, k 2 — hi fa — hi--- 
which is not zero is positive. 

We first prove that, if the above term h is the highest term of S, then 

fa = fa ^ fa'" ^ fa- 

For, if fa < fa, the symmetric polynomial S would contain the term 

ki fcl fca kn 

yjjtXj -1 Ju c\ Ju q tj 1 

which is higher than h. If fa < fa, S would contain the term 

fci k% ko kn 

Hi JU I Ju CI Ju O JLtiy\ * 

which is higher than h, etc. 

If the highest term in another homogeneous symmetric polynomial S' is 

U' — n ' fc l™ fc 2 . . . k' n 

IV — Uj Ju -1 Jucy ^"fl 5 

and that of S is h, then the highest term in their product SS' is 

hti = aa'x k 1 1+k ' 1 ---x>!r +k ". 
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Indeed, suppose that SS' has a term, higher than hh' , 



(3) cx 1 ^ 1 ' 1 ■ ■ ■ - ln+l ' n 






which is either a product of terms 



t = bx-t ■ ■ ■ x l £ , t' = b'x-l 



A J 



a 



of S and S' respectively, or is a sum of such products. Since (3) is higher 
than hh', the first one of the differences 

'l T I \ ~ K\ — k\, ■ ■ ■ j in + <"a ~ ™n ~ k n 



n 



which is not zero is positive. But, either all of the differences l\ — k\, . . . , l n — h 
are zero or the first one which is not zero is negative, since h is either identical 
with t or is higher than t. Likewise for the differences l^ — k[, . . . , l' n — k' n . We 
therefore have a contradiction. 

It follows at once that the highest term in a product of any number of 
homogeneous symmetric polynomials is the product of their highest terms. 
Now the highest terms in E\, E^-, E$, . . . , E n , given by (2), are 

xi, £122, ^1^2a;3, ■■■, x\X2---x n , 

respectively. Hence the highest term in E^E^ 2 ■ ■ ■ E® n is 

ai+a 2 H ha„ a 2 H \-a n _ _ ^ a n 

•Xj 1 O/i) Jb, 



II 



Thus the highest term in 

a = aE^- k2 E^- k3 ■ ■ ■ E k ^ kn E k n n 

is h. Hence Si = S — a is a homogeneous symmetric polynomial of the same 
total degree k as S and having a highest term hi not as high as h. As before, we 
form a product <ri of the £"s whose highest term is this hi. Then S2 = Si — 01 
is a homogeneous symmetric polynomial of total degree k and with a highest 
term hi not as high as h\. We must finally reach a difference St — at which is 
identically zero. Indeed, there is only a finite number of products of powers of 
xi, . . . ,x n of total degree k. Among these are the parts h', h\, h' 2 , ■ ■ ■ of h, hi, 
/12,... with the coefficients suppressed. Since each hi is not as high as hi-i, 
the h! , h[, h' 2 , . . . are all distinct. Hence there is only a finite number of h^. 
Since St — at = 0, 

S = a + Si = a + ai + S2 = ■ ■ ■ = a + ai + a 2 + ■ ■ ■ + at. 
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Hence 5 is a polynomial in E\, E2, . . . , E n and a, b, ... , with integral coefficients. 
Example 1. If S = Y,x\x\x?, and n > 4, we have 

a = E2E3 = S + 3Sx 1 X2X3X4 + IOEX1X2X3X4X5, 
Si = S — a = — 3Ex 1 X2X3X4 — IOSX1X2X3X4X5, 
G\ = SE1E4 = —?>{Y;x\x 2X3X4 + 5SX1X2X3X4X5), 
^2 = Si — <J\ — 5EX1X2X3X4X5 = 5.E5, 

5 = a + 5i = (7 + ^ + 52 = ^2^3 - ZE X E A + 5£ 5 - 
Example 2. If 5 = Sxfx2^3 an d n > 4, 

(T = E^E^ = E , i(Sx^2^3 + 4ExiX2^3^4) 

= Sx x X2X3 + 2Sx 1 x 2 X3 + 3Ex 1 X2X3a;4 

+ 4(Sx 1 X2a;3X4 + 5^x1X2X3X4X5), 

S\ = S — a = — 2Ex 1 x 2 X3 — 7^x^x2X3X4 — 2OSX1X2X3X4X5. 
Take <j\ = —2E2E3 and proceed as in Ex. 1. 

Example 3. By examples 1 and 2, if n > 4, 

aY>xXx\x-i + bZxlx 2 x 3 = bE\ E 3 - (3a + b)E x E A + (a - 2b)E 2 E 3 + 5(o + &)£? 5 . 



105. Rational Functions Symmetric in all but One of the Roots. 

If P is a rational function of the roots of an equation f(x) = of degree n and if 
P is symmetric inn — 1 of the roots, then P is equal to a rational function, with 
integral coefficients, of the remaining root and the coefficients of f(x) and P. 
For example, P = rai + a| + a i + ' ' ' + a n 1S symmetric in 02, ■ • • , a n , and 

P = rai + £«! — cti = c 1 — 2c2 + rai — a 1; 

if «i, . . . ,a n are the roots of equation (1). 

Since 1 any symmetric rational function is the quotient of two symmetric 
polynomials, the above theorem will follow if proved for the case in which the 
words rational function are in both places replaced by polynomial. 



1 If N/D is symmetric in <x\, 0.1, and the polynomials N and D have no common factor, 
while N becomes N' and D becomes D' when ai, a-i are interchanged, then ND' = DN' . 
Thus N divides N' and both are of the same degree. Hence N' = cN, D' = cD, where c is 
a constant. By again interchanging ai, 02, we obtain N from N' , whence N = cN 1 = c 2 N, 
c 2 = 1. If c = — 1, we take a.\ = a^ and see that N = N' = —N, N = 0, whence N has the 
factor «i — Q!2- Similarly, D has the same factor, contrary to hypothesis. Hence c = +1 and 
N and D are each symmetric in ai, Qli- 
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If a\ is the remaining root, the polynomial P is symmetric in the roots 
«2, ■ ■ ■ ,a n °f f( x )/( x — a\) — 0, an equation of degree n — 1 whose coefficients 
are polynomials in ct\, c\,...,c n with integral coefficients. Hence (§104), P 
is equal to a polynomial, with integral coefficients, in a\, ci,...,c n and the 
coefficients of P. 

Example. If a, j3, 7 are the roots of f(x) = x 3 + px 2 + qx + r = 0, find 

« 2 + / 3 2 _a 2 + / 3 2 « 2 + 7 2 /3 2 + 7 2 

ij 7T~ — ~7, r r — ~, • 

a + p a + p a+7 P+7 

Solution. Since (3 2 + 7 2 = p 2 — 2q — a 2 , (5 + 7 = — p — a, 

^ 2 + /? 2 ^ P 2 -2q-a 2 ( 2q \ 1 

£ — = £ = £ a — p H = — p — 3p + 2g£- 



a + (3 —p — a \ a + p J 'a+p 

But a + p, (3 + p, 7 + p are the roots yi, 1/2, 2/3 of the cubic equation obtained from 
f(x) = by the substitution x + p = y, i.e., x = y — p. The resulting equation is 

y 3 - 2py 2 + (p 2 + q)y + r - pq = 0. 

Since we desire the sum of the reciprocals of y\, 7/2 , 1J3, we se t y = 1/z and find the 
sum of the roots z\, Z2, £3 of 

1 — 2pz + (p 2 + q)z 2 + (r — pq)z 3 = 0. 

Hence 

^ 1 ^ 1 ^ p 2 + q ^ a 2 + (3 2 2q 2 - 2p 2 q + Apr 

£ — — = £ — = Ezi = , £ —5- = • 

a+p V\ pq — r a + p pq — r 



EXERCISES 

[In Exs. 1-12, a, (3, 7 are the roots of f(x) = x 3 + px 2 + qx + r = 0.] 
Using /?7 + a((3 + 7) = g, find 

1. s ^7 + « 2 2 ^7 -2a 2 

/3 + 7 ' /3 + 7 - a ' 

3. Why would the use of /?7 = —r/a complicate Exs. 1, 2? Verify that 

-r /(a) - r 2 

P7 = — = = a + pa + q. 

a a 

/3 2 + 7 2 

4. Why would you use (3j = —r/a in finding £ — ? 

P7 + c 
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,3 „ r^-^iP-l 



\ 2 

5. Find S(/5 + 7 ) 2 . 6. Find X(a + /3-7) 3 . 7. Find £ ' ' '. . 

V/9 + 77 

8. Find a necessary and sufficient condition on the coefficients that the roots, in 

112 — 3r 
some order, shall be in harmonic progression. Hint: If — | — = — , then /3 = 0, 

a 7 p q 

and conversely. Hence the condition is 

9. Find the cubic equation with the roots (3y , ay — — , a(3 . Hint: since 

a (3 7 

these are (— r — l)/a, etc., make the substitution (— r — l)/x = y. 

Find the substitution which replaces the given cubic equation by one with the 
roots 

10. a(3 + cry, a/3 + fjy, cry + /3y. 

11- 73- . etc - 12. -P , etc. 

P + 7-a /3 + 7-2a 

If a, /3, 7, 8 are the roots of x 4 + px 3 + c^ 2 + rx + s = 0, hnd 

!- ^ 2 + 7 2 + * 2 14 ^-y + 0S + ll s 

/3 + 7 + 5 ' ' /? + 7 + <*-3' 

15. Prove that if j/i, y 2 , ^3 are the roots of y 3 + py + q = 0, the equation with 
the roots z 1 = (y 2 - y 3 ) 2 , z 2 = (yy - y 3 ) 2 , ^3 = (yi ~ yi) 1 is 

z 3 + 6pz 2 + 9p 2 z + 4p 3 + 27g 2 = 0. 

Hints: since z\ = Tjy 2 — 2y 2 y^ — y 2 = —2p+2q/yy — y 2 , etc., we set z = —2p+2q/y — y 2 . 
By the given equation, y 2 + p + g/y = 0. Thus the desired substitution is z = 
-p + 3q/y, y = 3q/(z + p). 

16. Hence hnd the discriminant of the reduced cubic equation. 

17. If xy, ■ ■ ■ , x n are the roots of f(x) = 0, show that 

s 1 _-f(c) m 
X-y-C /(c) 

Hint: xy — c — yy, . . . , x n — c — y n are the roots of 

f(c + y) = f(c)+yf>(c) + y 2 ( ) + ••• = 0, 
as shown by Taylor's theorem. Or we may employ (5) below for x = c. 
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106. Sums of Like Powers of the Roots. If a\, ... ,a n are the roots 
of 

(1) f(x) = x n + Cl x n - 1 + c 2 x n ~ 2 + ■ ■ ■ + Cn = 0, 

we write si for Sai, s 2 for £a 2 , and, in general, 

s fc = £af = af + a\ -\ V a\. 

The factored form of (1) is 

(4) f(x) = (x-a\){x-a2) ■■■(x-a n ). 

The derivative f'(x) of this product is found by multiplying the derivative 
(unity) of each factor by the product of the remaining factors and adding the 
results. Hence 

f'(x) = (x - a 2 ) • • • (x - a n ) + (x - a.\){x - a 3 ) • • • (x - a n ) H , 

(5) ,-w.JW +JW- + ...+ /w 



x — ot\ x — a 2 x — a n 

If a is any root of (1), f(a) = and 

fix) fix) — fia) x n — a n x 71-1 — a n ~ x — a 

+ C\ + ••• + %_!- 



x — a x — a x — a x — a x — a 

= x n ~ l + ax n ~ 2 + a 2 x n ~ 3 H h ci(x™~ 2 + ax™" 3 H ) 

+ C2 (^- 3 + . ••) + •••, 

/( x ) _ „n-l ,/„,,„ \„n-2 , / ,2 , , „ x^n-3 



(6) X - a 



x 71 1 + (a + c^x 71 z + (cT + cia + c 2 )x ri ^ H 

+ (a fc + cia^" 1 + qc^" 2 + • • • + c fc _ia + c k )x n ~ k ~ X + 



Taking a to be a\,...,a n in turn, adding the results, and applying (5), we 
obtain 

/'(&) = ra"" 1 + (si + nci)x n " 2 + (s 2 + cisi + nc 2 )x n ~ 3 + ■■■ 

+ (sk + c l s k-l + c 2 s k _ 2 + ■■■ + c fc _xsi + nc fc )x ri " fc " 1 + • • • . 

The derivative of (1) is found at once by the rules of calculus (or by §56) 
to be 

f'(x) = nx 71 ' 1 + (n - l)cix n - 2 + in - 2)c 2 x n " 3 + • • • + (n - k)c k x n ~ k - 1 + ■■■ . 

Since this expression is identical term by term with the preceding, we have 

si + ci = 0, s 2 + cisi + 2c 2 = 0, . . . , 

s fc + cisfc.x + c 2 s fc _ 2 H hcfc.xsi + fccfe = (fc^n-1). 
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We may therefore find in turn s\, s 2 , ■ ■ ■ , s n -\\ 

(8) s\ = -ci, S2 = cj-2c 2 , s 3 = -c\ + 3c!C 2 - 3c 3 ,... . 

To find Sn, replace x in (1) by a\,...,a n in turn and add the resulting 
equations. We get 

(9) s n + cis n _i + c 2 s n _ 2 H 1- c n -isi + ™c n = 0. 

We may combine (7) and (9) into 

(10) s k + cis k _ 1 + c 2 s k _ 2 ^ h Ck-isi + kc k = (A; = l,2,...,n). 

This set of formulas (10) will be referred to as Newton's identities. The 
student should be able to write them down from memory and, when writing 
them, should always check the final one (9) by deriving it as above. 

To derive a formula which shall enable us to compute the s k for k > n, we 
multiply (1) by x k ~ n , take x = a\, . . . ,x = a n in turn, and add the resulting 
equations. We get 

(11) s k + c\s k _i + c 2 s k _ 2 H ^c n s k _ n = (k>n). 

Instead of memorizing this formula, it is preferable to deduce it for the 
particular equation for which it is needed, thus avoiding errors of substitution 
as well as confusion with (10). 

Example. Find s fc for x n — 1 = 0. 

Solution. Comparing our equation with (1), we have c\ = 0, ...,c n _i = 0, 
c„ = — 1. Hence in (10) for k < n, each c is zero and Sk = 0. But, for k = n, (10) 
becomes s n — n = 0. We may check the latter by substituting each root a±, . . . , a n in 
our given equation and adding. Finally, to find si when / > n, multiply our equation 
by x l ~ n . In the resulting equation x l — x l ~ n = we substitute each root, add, and 
obtain si — s/_ n = 0. Hence from s; we obtain an equal s by subtracting n from I. 
After repeated subtractions, we reach a value k for which 1 ^ k ^ n. Since s k = 
or n according as k < n or k = n, it follows that si = or n according as / is not or 
is divisible by n. 

EXERCISES 

1. For a cubic equation, S4 = c\ — 4c^C2 + 4ciC3 + 2c|. 

2. For an equation of degree ^ 4, S4 = c\ — 4cf C2 + 4cic t 3 + 2c^ — 4c4. 

3. Find S2, S3, S4, S5 for x 2 — px + q = 0. 

4. Find Sfc for x 5 - 3 = 0. 

5. Find 82, S3, sq, sj for x 5 — px + q = 0. 
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107. Waring's Formula for s^ in Terms of the Coefficients. While 
we have learned how to find si, S2, S3,... in turn by Newton's identities, 
it is occasionally useful to know an explicit expression for s^, where k has 
an arbitrary value. The formula in question is applied ordinarily only to a 
quadratic equation 

x + px + q = 0. 

Accordingly we shall treat this case in detail. If its roots are a and f3, then 

x + px + q = (x — a)(x — (3). 

Replace x by 1/y and multiply by y 2 . We get 

(12) l+py + qy 2 = (l-ay)(l-py). 

Taking derivatives, we have 

p + 2qy = -ail - 0y) - /3(1 - ay). 

Change of signs and division by the members of (12) gives 

/ 1q n ~P - 2qy a__ . P 

\ i6 ) 1 , „„. , „„.2 - 1 „.„. + 



1 + py + qy 2 1 - ay 1 — Py 

The identity in Ex. 7, §14, with n changed to k, may be written in the form 



1 • „2 , , Jfc-1 r 



k 



(14) = \ + r + r A + ---+r K ~ l + 

1 — r 1 — r 

Take r = ay and multiply the resulting terms by a; thus 

a A- A--] a k+1 y k 

- a + a z y + ■ ■ ■ + a k y k v + 



1 — ay ' ' 1 — ay 

Similarly, 

P a , ,2.. , , akk-i , P k+1 y k 



/3 + /^ + --. + /?V c - 1 + 



1 - Py 1-Py 

To show that on adding, and writing s^ for a k + f3 k , we obtain (15), we need 
the sum of the final fractions, which by (12) is 

^ " ^ <t> - « fc+1 d - Py) + P k+1 d - ay). 



(1 -ay) (l-Py) 1 + py + qy 2 

Hence 

/,r.\ a P u_i (py k 

(15) + T J ^ r = si + s 2 y + --- + s k y k L + 



1 - ay 1 - py 1 + py + qy z 
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where the exact expression for <f> is immaterial. 

Next, we seek an expansion of the fraction in the left member of (13). Its 
denominator will be identical with that in (14) if we choose r = —py — qy 1 . 
Evidently (14) may be written in the compact form 

., k— 1 u 

1 sr^ t r 

1 — r z — ' 1 — r 

t=0 



Hence it becomes 



k—l , u 

— ipy K 



l+py + qy z f^i l+py + qy A 



t=o 

where ip = (—p — qy) k , although no use will be made of the particular form of 
the polynomial ip. By the binomial theorem, 

where the summation extends over all sets of integers g and h, each ^ 0, for 
which g + h = t, while g\ denotes the product of 1, 2, . . . , g if g ^ 1, but denotes 
unity if g = 0. Hence 

(16) ~ P ~l q \ = (p + 2^)£(-1)^ +1 ^^pV^ +2/i + £, 

1 + py + qy A ^— ' ^In! 

(-p - 2qy)i\)y k 

1 + py + <?y z 

where the summation extends over all sets of integers g and h, each ^ 0, for 
which g + h S k — 1. 

Since the left members of (15) and (16) are identically equal by (13), their 
right members must be identical, so that the coefficients of y k ~ 1 in them must 
be equal. 2 Hence the coefficient s^ of y k ~ 1 in (15) is equal to the coefficient 
of y k ~ l in (16), which is made up of two parts, corresponding to the two terms 
of the factor p+2qy. When we use the constant term p, we must employ from Yl 
in (16) the terms in which the exponent of y is equal to k — 1. But when we use 
the other term 2qy, we must employ from Yl the terms in which the exponent 
of y is equal to k — 2, in order to obtain the combined exponent k — 1 of y. 



2 In fact, the (fc — l)th derivatives of the two right members are identical, and we obtain 
the indicated result by substituting y = in these two derivatives and equating the results. 
Note that the final terms in both (15) and (16) have y as a factor of their (k— l)th derivatives. 
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Hence s^ is equal to the sum of the following two parts: 

p £ ( _l)*+fc+i ^±£W 0/ + 2/i = fc - 1), 

2g £(-l)* +ft+1 ^j^V (5 + 2* = k - 2). 

In the upper sum, write i for g+ 1, and j for /i. In the lower sum, write i for g, 
and j for h + 1. Hence 

where now each summation extends over all sets of integers i and j, each ^ 0, 
for which 

(17) i + 2j = k. 

Finally, we may combine our two sums. Multiply the numerator and de- 
nominator of the first fraction by i, and those of the second fraction by j. 
Thus 

(is) 8fc = fc ^ ( -i)i+i (i±J^ p y, 

since the present fraction occurred first multiplied by i and second multiplied 
by 2j, and, by (17), the sum of these multipliers is equal to k. Our final result 
is (18), where the summation extends over all sets of integers i and j, each ^ 0, 
satisfying (17). 

If we replace i by its value k — 2j, and change the sign of p, we obtain from (18) 
the result that the sum of the kth powers of the roots of x 2 — px + q = is equal to 



(19) * = '£(-l)'|^yf/- 2 V 

= / - k P «-\ + m^-v - fc( *: 4 ^ (t 3 ~ 5 V -v + • • • , 

where K is the largest integer not exceeding k/2. 

The product of the roots is equal to q. Hence if x denotes one root, the second 
root is q/x. Thus sj~ = x k + (q/x) k . Again, the sum of the roots is x + q/x = p. 
Regard q as given and p as unknown. Hence, if c is an arbitrary constant, the 
equation 

(20) p k - kqp k ~ 2 + M^_^y-4 _... = c 
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is transformed by the substitution p = x + q/x into 

x k + ( q -) k = c. 



Hence equation (20) may be solved for p by radicals by the method employed in §43 
for a cubic equation. 

The above proof applies 3 without essential change to any equation x n + 
c\x n ~^ + • • • + c n = and leads to the following formula for the sum of the 



Mh powers c 


)f its roots: 








(21) 


* fc = *£(-ir + "-^^ 


■ + r n - 
•••r n \ 


"l)!n. 





where the summation extends over all sets of integers rq, . . . ,r n , each ^ 0, for 
which r\ + Ir^ + 3r^ + • • • + nr n = k. This result (21) is known as Waring 's 
formula and was published by him in 1762. 

Example. Let n = 3, k = 4. Then rq + 2r 2 + 3r 3 = 4 and 

(ri,r 2 ,r 3 ) = (4,0,0), (2,1,0), (1,0,1), (0,2,0), 

„,3! 4 2! 2 1! 1! 2 . 

54 = 4( 4! Cl " 2!l! ClC2 + ml" 1 " 3 + 2! C2) 
= c 4 -4c 2 C2 + 4cic 3 + 2c2. 



EXERCISES 

1. For the quadratic x 2 — px + q = write out the expressions for s 2 , S3, S4, 
■S5 given by (19), and compare with those obtained from Newton's identities (Ex. 3, 
§106). 

2. Find S4 for a quartic equation by Waring's formula. 

3. For k = 5, (20) becomes De Moivre's quintic p 5 — 5gp 3 + 5q 2 p = c. Solve it 
by radicals for p. 

4. Solve (20) by radicals when k = 7. 



3 See the author's Elementary Theory of Equations, pp. 72-74, where there is given also 
a shorter proof by means of infinite series. 
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108. S-functions Expressed in Terms of the Functions Sf~. Since we 
have learned two methods of expressing the s^ in terms of the coefficients, it is 
desirable to learn how to express any E-polynomial (and hence any symmetric 
function) in terms of the s^. 

By performing the indicated multiplication, we find that 

s a s b = £a? • Ea\ = Sa^ +6 + mZafa b 2 , 

where m = 1 if a ^ b, m = 2 if a = b. Transposing the first term, which is equal 
to s a+b , and dividing by m, we obtain 

(22) Za a ia b 2 = ^(s a s b -s a+b ). 

In order to compute £o|o2 a 3 in terms of the s^, we form the product 
y^4v^32 ^72,^63,^432 

Making three applications of (22), we get 

S4(s3^2 - «5) = (S7S2 - sg) + (s G S3 - s 9 ) + Safa^al- 

Hence 

£a;La2 a 3 = S2S3S4 — S2 S 7 ~~ S 3 S 6 ~~ S 4 S 5 + 2«9- 



EXERCISES 

For a quartic equation, express in terms of the Sk and ultimately in terms of the 
coefficients ci, . . . ,04: 

1. Yja\a\. 2. T,afa2- 3. £0^0203- 4. Yja\a\a\. 

5. If a ^ b > c > 0, prove that 

£010203 = —(s a SbS c - S a Sb+c ~ SbSa+c ~ S c S a+ b + 2s a +b+c), 

m 
where m = 1 if a > b, m = 2 if a = b. 

6. Eafa^az = \{s a sl - s a s 2 b - 2s b s a+b + 2s a+2 b), a > b > 0. 

7. Y,afa%a1 = ±(s 3 a - 3s a s 2a + 2s 3a ), a > 0. 
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109. Computation of Symmetric Functions. The method last ex- 
plained is practicable when a term of the E-function involves only a few dis- 
tinct roots, the largeness of the exponents not introducing a difficulty in the 
initial work of expressing the E-function in terms of the Sf,. 

But when a term of the E-function involves a large number of roots with 
small exponents, we resort to a method suggested by §104, which tells us 
which auxiliary simpler symmetric functions should be multiplied together to 
produce our E-function along with simpler ones. 

For example, to find T,x 2 X2XsX4, when n > 4, we employ 

E1E4 = Exi • EX1X2X3X4 = Y,X 1 X2X3X4 + 5EX1X2X3X4X5, 
Y,X 1 X2X3X4 = E1E4 — 5E$. 

To find Exfx^x 2 ^, employ E3E4 = EX1X2X3 • EX1X2X3X4. 

When many such products of E-functions are to be computed, it will save time 
in the long run to learn and apply the "method of leaders" explained in the author's 
Elementary Theory of Equations, pp. 64-65. 

MISCELLANEOUS EXERCISES 

Express in terms of the coefficients c±, . . . , c n : 

1. Eo^0203- 2. T,a 2 a 2 a^. 3. T,a 2 a 2 asa4. 4. Y,a 2 a 2 a 2 . 

If a, (3, 7 are the roots of x 3 + px 2 + qx + r = 0, find a cubic equation with the 
roots 

5. a 2 , (3\ 7 2 . 6. a/3, 07, M- 7. -, %, -. 

a p 7 

8. a 2 + /3 2 , a 2 + 7 2 , j3 2 + 7 2 . 9. a 2 + a(3 + /3 2 , etc. 

If a, (3, 7, 6 are the roots of x 4 + px 3 + qx 2 + rx + s = 0, find 

10. E^ = S /? + 7 + g = S-P^ = -4-pSl 

a a a a 

11. s4> • Use E- • E^ = E^ + 3E- + 2E^-. 

a a a a 1 a ap 

12. Express Ea"a2 a 3 Q 4 hi terms of the Sk when (i) a > b > c > d > 0, and 
(m) when a = b = c = d. 

13. By solving the first A: of Newton's identities (10) as a system of linear equa- 
tions, find an expression in the form of a determinant (i) for Sk in terms of ci, . . . , Cfc, 
and (m) for Ck in terms of 81, ... , Sfc. 



158 SYMMETRIC FUNCTIONS [Ch. IX 



14. One set of n numbers is a mere rearrangement of another set if s\, . . . ,s n 
have the same values for each set. 



CHAPTER X 

Elimination, Resultants And Discriminants 

110. Elimination. If the two equations 

ax + b = 0, ex + d = (a / 0, c / 0) 

are simultaneous, i.e., if x has the same value in each, then 

b d 

x = — = — , R = ad — be = 0, 

a c 

and conversely. Hence a necessary and sufficient condition that the equations 
have a common root is R = 0. We call R the resultant (or eliminant) of the 
two equations. 

The result of eliminating x between the two equations might equally well 
have been written in the form be — ad = 0. But the arbitrary selection of R 
as the resultant, rather than the product of R by some constant, as — 1, is 
a matter of more importance than is apparent at first sight. For, we seek a 
definite function of the coefficients o, 6, c, d of the functions ax + b, ex + d, 
and not merely a property R = or R / of the corresponding equations. 
Accordingly, we shall lay down the definition in §111, which, as the reader 
may verify, leads to R in our present example. 

Methods of elimination which seem plausible often yield not R itself, but 
the product of R by an extraneous function of the coefficients. This point (il- 
lustrated in §114) indicates that the subject demands a more careful treatment 
than is often given. 

111. Resultant of Two Polynomials in x. Let 

(1) ( f(x) = a x m + a 1 x m - 1 + --- + a m (a + 0), 

1 ' ' ' { g(x) = b x n + b x x n - X + • • • + b n (bo + 0) 

be two polynomials of degrees m and n. Let a\, . . . , am be the roots of f(x) = 0. 
Since u\ is a root of g(x) = only when g(a\) = 0, the two equations have a 
root in common if and only if the product 

g(a\)g(<X2) ■ • ■ g(a m ) 
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is zero. This symmetric function of the roots of f(x) = is of degree n in any 
one root and hence is expressible as a polynomial of degree n in the elementary 

symmetric functions (§104), which are equal to —a\/ao, 02/00, To be rid 

of the denominators 00, it therefore suffices to multiply our polynomial by clq. 
We therefore define 

(2) R(f,9) = aQg(a 1 )g(a 2 )---g(a m ) 

to be the resultant of / and g. It equals an integral rational function of 
ao, • • • , a m , bo, . . . , b n with integral coefficients. 

EXERCISES 

1. If m = 1, n = 2, R(f, g) = b§a\ — b\a§a\ + b 2 a\. 

2. If m = 2, n = 1, #(/,#) = a (6oai + 61) (600:2 + &i) = a &i - ai& 0i + a 2 &o, 
since 

ao(«i + 02) = — ai, aoaia2 = 02- 

3. If 0i, . . . , n are the roots of g{x) = 0, so that 

g(an) = b (ai - 0i)(oii - 2 ) ■ ■ ■ (a* - n ), 

then 

R(f,g) = a^( ai - /3i)(ai -&) • • • («i - /3„) 

•(a 2 - /?l)(«2 - 02) ■ ■ ■ («2 - /3n) 



•(a m - /?i)(a m - 2 ) ■ ■ ■ (a m - 0n)- 
Multiplying together the differences in each column, we see that 

R(f,g) = (-ir n b^f(0i)f(02)---f(0n) = (-l) mn R(g,f). 

4. If m = 2, n = 1, R(g,f) = b^f^-hi/bo) = a$b\ — aibobi + a 2 b^, which is 
equal to R{f,g) by Ex. 2. This illustrates the final result in Ex. 3. 

5. If m = n = 2, 

R(f,g) = alblajal + alb bia 1 a 2 (ai + a 2 ) 

+ 00^0^2(0? + ot 2 ) + a\b\a\a 2 + Oo6i6 2 (ai + "2) + a\b\ 

00 / 9 \ 9 9 9 

— b a 2 - bobia\a 2 + 60^2(^1 - 2aoa2J + b^o^ - b\b 2 aoai + a o 2 - 
This equals R{g, /), since it is unaltered when the a's and b's are interchanged. 

6. Prove by (2) that R is homogeneous and of total degree m in bo, . . . , b n ; and 
by Ex. 3, that R is homogeneous and of total degree n in ao, • • ■ , a m . Show that R 
has the terms a£6™ and (-l) mn 6g*o^. 

7. R(f, gi g 2 ) = R(f, gi )-R(f,g 2 ). 

8. i?(/,x") = (-l)™i*(z n ,/) = {-l) mn a n m . 
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112. Sylvester's Dialytic Method of Elimination. 1 Let the equa- 
tions 

f(x) = oqx + a\x + a^x + 03 = 0, g(x) = box + b\x + 62 = 

have a common root x. Multiply the first equation by x and the second by x 2 
and x in turn. We now have five equations 



4 3 2 

aox + aix + a2X + a3X 

3 2 

aox + aix + a2X + 03 

6 x 4 + &ix 3 + 6 2 ^ 2 

&qx + &ix + b 2 x 



box + &ix + 62 



which are linear and homogeneous in x 4 , x 3 



x , x 



, 1. Hence (§97) 



(3) 



F 



(iq a\ a2 G13 

ciq a\ 02 as 

b 61 6 2 

6 h b 2 

6 61 b 2 



must be zero. Next, if F = 0, there exist (§97) values which, when substituted 
for x 4 , x 3 , x 2 , x and 1, satisfy the five equations. But why is the value for x 4 
the fourth power of the value for x, that for x 3 the cube of the value for x, 
etc.? Since the direct verification of these facts would be very laborious, we 
resort to a device to show that, conversely, if F — the two given equations 
have a root in common. 

In (3) replace 03 by 03 — z and consider the equation 



(4) 



a ( ) 


01 


a.2 


as~ z 








oo 


ai 


2 


03 - z 


bo 


bi 


b-2 











bo 


h 


^2 











bo 


h 


b 2 



0. 



To prove that it has the roots /(/?i) and /(/?2), where /?i and /?2 are the roots 
of g(x) = 0, we take z = f(/3i) and prove that the determinant is then equal to 
zero. For, if we add to the last column the products of the elements of the first 
four columns by /3 4 , /3 3 , /3 2 , /%, respectively, we find that all of the elements of 
the new last column are zero. 



^^Given without proof by Sylvester, Philosophical Magazine, 1840, p. 132. 
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Since (4) reduces to (3) for z = 0, it is of the form 

biz 2 + kz + F = 0, 

in which the value of A; is immaterial. By considering the product of the roots 
of this quadratic equation, we see that 

F = b s f(p 1 )f(f3 2 ). 

Hence the Sylvester determinant F is the resultant R(g, f) and hence is the 
resultant R(f,g), since mn is here even (Ex. 3, §111). 
In general, if the equations are 



fix) = a x m + 



+ a r 



0, 



g(x) = b x n + ■ ■ ■ + b n = 0, 



„7l— 1 n.Tl—1 



we multiply the first equation by x 

, x 
linear and homogeneous in the m + n quantities x m+n ~ 1 
determinant 



, . . . , x, 1, in turn, and the sec- 



ond by x m 1 , x m 2 , . . . , x, 1, in turn. We obtain n + m equations which are 

, x, 1. Hence the 



(5) 



F 



ao a l a 2 

ao a\ ci2 ■ ■ 
oq cl\ a2 







a m 
• • • am, 



ao a\ a2 • • ■ 

bo b\ b n 

b b\ b n 







a m 










b &i 6 r> 



> n rows 



> m rows 



is zero. It may be shown to be equal to the resultant R(f,g), whether mn is 
even or odd, by the method employed in the above case m = 3, n = 2. 

We may also prove as follows that if F = the equations / = and 
g = have a common root. Since F was obtained as the determinant of the 
coefficients of 



„n-l 



777 — 1 

x g,...,xg,g, 



x j, . . . , xj , / , 
F = implies, by §96, Lemma 2, the existence of a linear relation 

Box* 1 ' 1 / + ■■■ + B n _ 2 xf + B n _if + Aox m - l g +■■■ + A m _ 2 xg + A m _ l9 = 0, 

identically in x, with constant coefficients Bq, . . . ,A m _i not all zero. In other 
words, (3f + ag = 0, where 



(6) a = Aq. 



jm—\ 



+ h ^m-2^ + A 



m— 1) 



_n-l 



/? = S x n + • • • + J B n _ 2 x + S n _i. 
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Neither a nor (3 is identically zero. For, if a = 0, for example, then (3f = 
and /? = 0, whereas the A^ and Sj are not all zero. 

Consider the factored forms of /, g, a, (3. Suppose that / and g have no 
common linear factor. The highest power of each linear factor occurring in 
/ divides ag = —/3f and hence divides a. Thus / divides a, whereas / is of 
higher degree than a. Hence our assumption that / = and g = have no 
common root has led to a contradiction. 

A similar idea is involved in the method of elimination due to Euler (1707 
1783). If / = and g = have a common root c, then / = (x — c)a, —g = (x — c)/3, 
identically in x, where a and /3 are polynomials in x of degrees m — 1 and n — 1, 
respectively. Give them the notations (6). In the identity (5f + ag = 0, the coefficient 
of each power of x is zero. Hence 



ao-Bo 

aiB + a Bi 



+ b A 

+ 6iA + Mi 



Om-Bn-2 + a m -iB, 



n-1 
n-1 



+ b n A m _2 + b n -iA m _i— 
+ b n A m _ 1 = 0. 



Since these m + n linear homogeneous equations in the unknowns Bq, . . . ,B n _i, 
Aq, . . . , A m -\ have a set of solutions not all zero, the determinant of the coefficients 
is zero. By interchanging the rows and columns, we obtain the determinant (5). 



1. For m 



n 



EXERCISES 



2, show that the resultant is 



R 



a a l a 2 

ClQ CL\ (12 

b 6i 6 2 

b b\ b 2 



Interchange the second and third rows, apply Laplace's development, and prove that 



R = (a b 2 ) 2 - (a h)(aib 2 ), 
where (00^2) denotes aob 2 — a 2 bQ, etc. 

2. For m = n = 3, write down the resultant R and, by interchanges of rows, 
derive the second determinant in 



R 



(l() 


fll 


a 2 


0,3 










oo 


01 


o 2 


03 











0-0 


01 


2 


3 







bo 


bi 


b 2 


63 














0,0 


Ol 


2 


0,3 







Oo 


a 1 


02 


03 





60 


hi 


b 2 


h 













60 


61 


6 2 


b 3 








bo 


&i 


b 2 


63 













o 


Ol 


2 


0,3 








bo 


bi 


b 2 


&3 










60 


&i 


62 


63 
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To the second determinant apply Laplace's development, selecting minors from the 
first two rows, and to the complementary minors apply a similar development. This 
may be done by inspection and the following value of — R will be obtained: 

(a bi){(a 1 b2)(a 2 b 3 ) - (aib 3 ) 2 + (a 2 b 3 )(a b 3 )} 
-(a b 2 ){(a b 2 )(a 2 b 3 ) - {aob 3 )(a 1 b 3 )} 
+ (a b 3 ){(a bi)(a 2 b 3 ) - (a b 3 ) 2 }. 

The third term of the first line and the first term of the last line are alike. Hence, 
changing the signs, 

R = (aoM 3 - 2(a b 1 )(a b 3 )(a 2 b 3 ) - (a b 2 )(a b 3 )(aib 3 ) 

+ (a b 2 ) 2 (a 2 b 3 ) + (a &i)(ai& 3 ) 2 - (a bi)(aib 2 )(a 2 b 3 ). 

Other methods of simplifying Sylvester's determinant (5) are given in §113. 

113. Bezout's Method of Elimination. When the two equations are 
of the same degree, the method published by Bezout in 1764 will be clear from 
the example 

Q 9 ^9 

/ = ciqx + a\x + a 2 x + 03 = 0, g = b^x + b\x + b^x + 63 = 0. 

Then 

aog - bof, 

(7) (aQX + a{)g- (box + bi)f, 
(a x 2 + a\x + a 2 )g - (b x 2 + b\x + b 2 )f 

are equal respectively to 

(a 6i)a; 2 + (a b 2 ) x + (a bs) = 0, 

(8) (a b 2 )x 2 + {(a b s ) + ( ai b 2 )}x + (0163) = 0, 
{a bs)x 2 + (0163) x + (a 2 b s ) = 0, 

where (ao&i) = ao^l — a l^0i e ^ c - The determinant of the coefficients is the 
negative of the resultant R(f,g). Indeed, the negative of the determinant is 
easily verified to have the expansion given at the end of Ex. 2 just above. 

To give a more instructive proof of the last fact, note that, by (7), equations (8) 
are linear combinations of 

x 2 f = 0, xf = 0, / = 0, x 2 g = 0, xg = 0, g = 0, 

the latter being the equations used in Sylvester's method of elimination. The deter- 
minant of the coefficients in these six equations is the first determinant R in Ex. 2 
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just above. The operations carried out to obtain equations (8) are seen to corre- 
spond step for step to the following operations on determinants. To the products of 
the elements of the fourth row by ao add the products of the elements of the 1st, 
2nd, 3rd, 5th, 6th rows by — bo, —b\, —b2, Oi, 02 respectively [corresponding to the 
formation of the third function (7)]. To the products of the elements of the fifth row 
by ao add the products of the elements of the 2nd, 3rd, 6th rows by — bo, —b\, a\ 
respectively [corresponding to the second function (7)]. Finally, to the products of 
the elements of the sixth row by ao add the products of the elements of the third 
row by — bo [corresponding to aog — bof]. Hence 



o-o 


Ol 


o 2 


fl 3 











«0 


0,1 


0,2 


0,3 


















a 





Ol 

(0063) 
(aob 2 ) 

(a 6i) 


a-2 

(0163) 

(a b 3 ) + (0162) 

(aofe) 


a$ 
(0203) 
(0163) 

(a b 3 ) 



c$R 



so that R is equal to the 3-rowed minor enclosed by the dots. The method of Bezout 
therefore suggests a definite process for the reduction of Sylvester's determinant of 
order 2n (when m = n) to one of order n. 

Next, for equations of different degrees, consider the example 



/ = aox + a\x + a 2 x + a 3 x + 04, 



g = b x + b 1 x + b 2 . 



Then 



a x 2 g 



b f, (aox + a{)x g -(b x + bi)f 



are equal respectively to 

(aobi)x 3 + (aob 2 )x 2 - a 3 box - 0460, 

(a b2)x 3 + {(aio 2 ) - a 3 b }x 2 - {a 3 6i + a 4 b }x - a 4 &i 



The determinant of the coefficients of x 3 



x, 1 in these two functions and xg, 



g, after the first and second rows are interchanged, is the determinant of order 4 
enclosed by dots in the second determinant below. Hence it is the resultant R(f,g). 
As in the former example, we shall indicate the corresponding operations on 
Sylvester's determinant 

a a i CL2 03 04 



R 



Multiply the elements of the third and fourth rows by ao- In the resulting determi- 
nant a^R, add to the elements of the third row the products of the elements of the 






Of) 


04 


0,2 


0,3 


0,4 


bo 


bi 


b 2 














bo 


61 


02 














b 


6l 


b 2 














bo 


bi 


b 2 
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first, second and fourth rows by — bo, — b\, a\/ao respectively. Add to the elements 
of the fourth row the products of those of the second by — bo- We get 



a 2 Q R 



<l() 


0,1 


02 


0,3 


0,1 








a 


Ol 


0.2 


o 3 


o-i 








(aob 2 ) 


(0162) - «3^0 


-a 3 6i - a A b 


— 0461 








(0061) 


(0062) 


-a 3 b 


-0460 








bo 


6l 


b 2 














bo 


61 


?>2 



Hence i? is equal to the minor enclosed by dots. 



EXERCISES 



1. For m = 3, n = 2, apply to Sylvester's determinant R exactly the same 
operations as used in the last case in §113 and obtain 



R 



(00^2) 

(ao&i) 

bo 



(0162) - a 3 &o 
(oofo) 

01 



-0361 

-a 3 6 
62 



2. For m 



n 



4, reduce Sylvester's R (as in the first case in §113) to 



(o 6i) (00^2) 

(00^2) (0.0^3) + (0162) 

(00^3) (aoM + (0163) 

(a 6 4 ) (0164) 



(a O 3 ) (aoM 

(a 6 4 ) + (0163) (0164) 

(0164) + (02^3) (02^4) 

(a 2 6 4 ) (03^4) 



114. General Theorem on Elimination. If any method of eliminat- 
ing x between two equations in x leads to a relation F = 0, where F is a 
polynomial in the coefficients, then F has as a factor the true resultant of the 
equations. 

Some of the preceding proofs become simpler if this theorem is applied. For 
example, determinant (3) is divisible by the resultant R. Since the diagonal 
term of (3) is a term a^b^ of R (Ex. 6, §111), F is identical with R. 

The preceding general theorem is proved in the author's Elementary The- 
ory of Equations, pp. 152-4. We shall here merely verify the theorem in an 
instructive special case. Let 



f = aQX + a\x + a 2 x + 03 



0, 



g = box 3 + bix 2 + b 2 x + 63 = 



have a common root x / 0. Then 

-bof + a g = {a$bi)x 2 + (a b 2 )x + (a b 3 ), 
(hf ~ as9)/x = (aob 3 )x 2 + (0163)0; + (0263)- 
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By Ex. 1 of §112, the resultant of these two quadratic functions is 

2 



(a b 3 ) (a &i) 
(a 2 6 3 ) (a 6 3 ) 



(0063) (aoh) 
(0163) (a fr 2 ) 



(ai& 3 ) (a & 2 ) 
(a 2 6 3 ) (ao&3) 



This is, however, not the resultant R of the cubic functions /, g. To show that 
(0063) is an extraneous factor, note that the terms of F not having this factor 
explicitly are 

(ao6i)(a 2 &3){(ao&l)(o2^3) - (ao&2)(«l&3)}- 
The quantity in brackets is equal to — (ao&3)(ai& 2 ), since, as in Ex. 2 of §101, 



(ao6i)(a 2 6 3 ) - (a 6 2 )(ai& 3 ) + (a &3)(ai6 2 ). 



We now see that F = (aobs)R, where R is given in Ex. 2 of §112. This method 
of elimination therefore introduces an extraneous factor (0063). The student 
should employ only methods of elimination (such as those due to Sylvester, 
Euler, and Bezout) which have been proved to lead to the true resultant. 

EXERCISES 

Find the result of eliminating x and hence find all sets of common solutions of 





O-O 


a 1 


a 2 


a 3 


1 


bo 


h 


b 2 


h 


2 


0-0 


a 1 


a 2 


a 3 




bo 


h 


b 2 


^3 



1. x 2 



y 



9, xy = by. 



2. x 2 + y 2 = 25, x 2 + 3(c - l)x + c(y 2 - 25) = 0. 

3. When x 2 + ax + b = has a double root, what 3-rowed determinant is zero? 

4. Find the roots of x 6 + 3x 4 + 32x 3 + 67x 2 + 32a; + 65 = by §79. 



115. Discriminants. Let a\, . . . , a m be the roots of 



(9) 

so that 



f(x) = aox m + a\x 



m—l 



-\ Va r 



(00 / 0), 



(10) /(x) = ao(x - ai)(x - a 2 ) • • • (x - a m ). 

As in §44, we define the discriminant of (9) to be 



D = a (ai - a 2 ) (01-03) • • • (ai - a m ) (a 2 - 03)' 



(%-l - «m)' 
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Evidently D is unaltered by the interchange of any two roots. Since the degree 
in any root is 2(m — 1), the symmetric function D is equal to a polynomial in 
ao, • • • , a m . Indeed, Oq is the lowest power of ao sufficient to cancel the de- 
nominators introduced by replacing S«i by — ai/ao, . . . , ai«2 • • • a m by ±a m /ao- 
By differentiating (10), we see that 

f'(ai) = oo(«i - «2)(ai - "3) • • • (<*l - «m), 
/'(a 2 ) = ao(a2 - ai)(«2 - 03) • • • («2 - %)> 
/'(a 3 ) = a (a 3 - ai)(a 3 - 02X03 - "4) • • • ("3 - %), 

etc. Hence 



ar 1 /'^!) • • • /'(a m ) = a§ m - 1 (-l) 1+2+ - +m - 1 (ai - a 2 ) 2 • • • (a m _i - a r 

m{rn— 1) 

= (-1)— 2— aoI>. 

By (2), the left member is the resultant of /(a;), f'(x). Hence 

m(m— 1) ] 



(11) 



D = (-l) 



«() 



-#(/,/')■ 



EXERCISES 

1. Show that the discriminant of / = y 3 +py + q = is — 4p 3 — 27g 2 by evaluating 
the determinant of order five for R(f, /'). 

2. Prove that the discriminant of the product of two functions is equal to the 
product of their discriminants multiplied by the square of their resultant. Hint: use 
the expressions in terms of the differences of the roots. 

3. For oq = 1, show that the discriminant is equal to 



1 a.\ a\ 

1 02 Oo 



1 Or 



07 



a 



m—l 
1 

m—l 
2 

m—l 
m 



«0 


Si 


S2 


Sm— 


1 


Si 


Si 


S3 


Sm 




Sm— 1 


Sfn 


Sm+1 " 


■ S2m- 


-2 



where s { = a\ H h a { m . See Ex. 4, §88; Ex. 2, §102. 

4. Hence verify that the discriminant of x 3 + px + q = is equal to 



3 -2p 

-2p -3<j 

-2p -3g 2p 2 



-4p 3 - 27g 2 . 
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5. By means of Ex. 1, §113, show that the discriminant of aox 3 +aix 2 +a2X+a^ = 



is 



2oofl2 a\ci2 + 3ao<23 2ai<Z3 
a\ 2a2 3a3 

3ao 2ai 02 



O o r\ r\ 

18aoaia2ffl3 — 4ao<X2 ~~ 4a 1 03 + a 1 a 2 — 27a a 3 . 



MISCELLANEOUS EXERCISES 

1 . Find the equation whose roots are the abscissas of the points of intersection 
of two general conies. 

2. Find a necessary and sufficient condition that 

f(x) = x + px + qx + rx + s = 

shall have one root the negative of another root. When this condition is satisfied, 
what are the quadratic factors of f{x)l Apply to Ex. 4, §74. Hint: add and subtract 
f(x) and f(-x). 

3. Solve f(x) = x 4 — 6x 3 + 13x 2 — 14a; + 6 = 0, given that two roots a and (5 
are such that 2a + (3 = 5. Hint: f{x) and /(5 — 2x) have a common factor. 

4. Solve x 3 + px + q = by eliminating x between it and x 2 + vx + w = y by 
the greatest common divisor process, and choosing v and tu so that in the resulting 
cubic equation for y the coefficients of y and y 2 are zero. The next to the last step 
of the elimination gives x as a rational function of y. (Tschirnhausen, Acta Erudit., 
Lipsiae, II, 1683, p. 204.) 

5. Find the preceding y-cubic as follows. Multiply x 2 + vx + w = y by x and 
replace x 3 by — px — q; then multiply the resulting quadratic equation in x by x and 
replace x 3 by its value. The determinant of the coefficients of x 2 , x, 1 must vanish. 

6. Eliminate y between y 3 = v, x = ry + sy 2 , and get 

x — 3rsvx — (r v + s' v ) = 0. 

Take s = 1 and choose r and v so that this equation shall be identical with x 3 + 
px + q = 0, and hence solve the latter. (Euler, 1764.) 

7. Eliminate y between y 3 = v, x = f + ey + y 2 and get 



1 e / — x 

e f — x v 
f — x v ev 



0. 



This cubic equation in x may be identified with the general cubic equation by choice 
of e, /, v. Hence solve the latter. 
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8. Determine r, s and v so that the resultant of 

o x + r 

V = v, y= — ■ — 

y + s 

shall be identical with x 3 + px + q = 0. (Bezout, 1762.) 

9. Show that the reduction of a cubic equation in x to the form y 3 = v by the 

substitution 

r + sy 
x = 

i + y 

is not essentially different from the method of Ex. 7. [Multiply the numerator and 
denominator of x by 1 — y + y 2 .] 

10. Prove that the equation whose roots are the n{n — 1) differences Xj — Xk of the 
roots of f{x) = may be obtained by eliminating x between the latter and f(x+y) = 
and deleting from the eliminant the factor y n (arising from y = Xj — Xj = 0). The 
equation free of this factor may be obtained by eliminating x between f(x) = and 

n— 1 

{/(* + y)- f(x)}/y = f{x) + /"(x)^ + ' ' ' + f in) ( x \ V 2 ... n = °' 

This eliminant involves only even powers of y, so that if we set y 2 = z we obtain 
an equation in z having as its roots the squares of the differences of the roots of 
f(x) = 0. (Lagrange Resolution des equations, 1798, §8.) 

11. Compute by Ex. 10 the z-equation when f(x) = x 3 + px + q. 



APPENDIX 



THE FUNDAMENTAL THEOREM OF ALGEBRA 

THEOREM. An equation of degree n with any complex coefficients 

f(z) = z n + a 1 z n - 1 + --- + a n = 

has a complex (real or imaginary) root. 

Write z = x + iy where x and y are real, and similarly a\ = c\ + id\, etc. By 
means of the binomial theorem, we may express any power of z in the form 
X + iY. Hence 

(1) f(z) = (j)(x,y) + itp(x,y), 

where <f> and V are polynomials with real coefficients. 

The first proof of the fundamental theorem was given by Gauss in 1799 
and simplified by him in 1849. This simplified proof consists in showing that 
the two curves represented by <f>(x, y) = and ij){x, y) = have at least one 
point (xi, yi ) in common, so that z\ — x\ + iy\ is a root of f(z) = 0. This proof 
is given in Chapter V of the author's Elementary Theory of Equations. 

We here give a shorter proof, the initial idea of which was suggested, but 
not fully developed, by Cauchy. 1 

LEMMA 1. a\h + a^h 1 + ••• + a n h n is less in absolute value than any 
assigned positive number p for all complex values of h sufficiently small in 
absolute value. 

The proof differs from that of the auxiliary theorem in §62 only in reading 
"in absolute value" for "numerically." 

We shall employ the notation \z\ for the absolute value +\/x 2 + y 1 of z = 
x + iy. 



^^For a history of the fundamental theorem, see Encyclopedic des sciences mathematiques, 
tome I, vol. II, pp. 189-205. 
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LEMMA 2. Given any positive number P, we can find a positive num- 
ber R such that \f(z)\ > P if \z\ ^ R. 

The proof is analogous to that in §64. We have 

J 77 



f(z) = z n (l + D), D = ai - +--- + a 



Since (Ex. 5, §8) the absolute value of a sum of two complex numbers is equal 
to or greater than the difference of their absolute values, we have 

\f(z)\^\z\ n [l-\D\]. 

Let p be any assigned positive number < 1. Applying Lemma 1 with h 
replaced by 1/z, we see that \D\ < p if \l/z\ is sufficiently small, i.e., if p = \z\ 
is sufficiently large. Then 

\f(z)\>p n (l-p)ZP 

if p n ^ P/{1 — p), which is true if 



^\/r^ sR 



This proves Lemma 2. 



LEMMA 3. Given a complex number a such that f(a) / 0, we can find a 
complex number z for which \f(z)\ < \f(a)\. 

Write z = a + h. By Taylor's theorem (8) of §56, 

f(a + h) = /(a) + f{a)h +■■■ + /W(o) — + ••• + / (n) («) • —,- 

r\ nl 

Not all of the values /'(a), /"(a), . . . are zero since f( n \a) = nl. Let f( r >(a) be 
the first one of these values which is not zero. Then 

f(a + h) _ f< r \a) tf, f {n \a) h™_ 

f(a) f(a) ' r! " ' f(a) ' n! ' 

Writing the second member in the simpler notation 

g(h) = 1 + bh r + ch r+1 + • • • + lh n , 6/0, 

we shall prove that a complex value of h may be found such that \g(h)\ < 1. 
Then the absolute value of f(z)/f(a) will be < 1 and Lemma 3 proved. To find 
such a value of h, write h and b in their trigonometric forms (§4) 

h = p(cos9 + ism 9), b = |6|(cos/5 + isin/3). 
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Then by §5, §7, 

bh r = \b\p r {cos(p + rO) + isin(p + r0)}. 

Since h is at our choice, p and angle 9 are at our choice. We choose 9 so that 
b + r9 = 180°. Then the quantity in brackets reduces to —1, whence 

g(h) = (1 - \b\p r ) + h r (ch +■■■ + lh n ~ r ). 

By Lemma 1, we may choose p so small that 

\ch+--- + lh n ~ r \ < \b\. 

By taking p still smaller if necessary, we may assume at the same time that 
\b\p r < 1. Then 

|<7(/0|<(l-|&|/) + /|6|, \g(h)\<i. 

Minimum Value of a Continuous Function. Let F(x) be any poly- 
nomial with real coefficients. Among the real values of x for which 2 ^ x ^ 3, 
there is at least one value x\ for which F(x) takes its minimum value F(xi), 
i.e., for which F(x\) ^ F(x) for all real values of x such that 2 ^ x ^ 3. This 
becomes intuitive geometrically. The portion of the graph of y = F(x) which 
extends from its point with the abscissa 2 to its point with the abscissa 3 either 
has a lowest point or else has several equally low points, each lower than all 
the remaining points. The arithmetic proof depends upon the fact that F(x) 
is continuous for each x between 2 and 3 inclusive (§62). The proof is rather 
delicate and is omitted since the theorem for functions of one variable x is 
mentioned here only by way of introduction to our case of functions of two 
variables. 

We are interested in the analogous question for 

G(x, y) = <j) 2 {x, y) + ip 2 (x, y), 

which, by (1), is the square of |/(-?)|. As in the elements of solid analytic 
geometry, consider the surface represented by Z = G(x, y) and the right circular 
cylinder x 2 + y 2 = R 2 . Of the points on the first surface and on or within their 
curve of intersection there is a lowest point or there are several equally low 
lowest points, possibly an infinite number of them. Expressed arithmetically, 
among all the pairs of real numbers x, y for which x 2 + y 2 ^ R 2 , there is 2 at 
least one pair xi, y\ for which the polynomial G(x,y) takes a minimum value 
G(x\,yi), i.e., for which G(xi,yi) ^ G(x,y) for all pairs of real numbers x, y 
for which x 2 + y 2 ^ R 2 . 



2 Harkness and Morley, Lntroduction to the Theory of Analytic Functions, p. 79, prove 
that a real function of two variables which is continuous throughout a closed region has a 
minimum value at some point of the region. 
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Proof of the Fundamental Theorem. Let z' denote any complex num- 
ber for which f(z') / 0. Let P denote any positive number exceeding \f(z')\. 
Determine R as in Lemma 2. In it the condition \z\ ^ R may be interpreted 

geometrically to imply that the point (x, y) representing z = x + iy is outside 
or on the circle C having the equation x 2 + y 2 = R 2 . Lemma 2 thus states that, 
if z is represented by any point outside or on the circle C, then |/(z)| > P. 
In other words, if \f(z)\ ^ P, the point representing z is inside circle C. In 
particular, the point representing z' is inside circle C. 

In view of the preceding section on minimum value, we have 

G( Xl , yi )^G(x,y) 

for all pairs of real numbers x, y for which x 1 + y 1 ^ R 2 , where x\, y\ is one 
such pair. Write z\ for x\ + iy\. Since |/(^)| 2 = G(x,y), we have 

i/(*oi ^ \m\ 

for all z's represented by points on or within circle C. Since z' is represented 
by such a point, 

(2) \f( Z1 )\^\f(z')\<P. 

This number z\ is a root of f(z) = 0. For, if f(z\) / 0, Lemma 3 shows 
that there would exist a complex number z for which 

(3) \f(z)\ < \f( Zl )\. 

Then \f(z)\ < P by (2), so that the point representing z is inside circle C, as 
shown above. By the statement preceding (2), 

l/(*l)l S \f(z)\. 
But this contradicts (3). Hence the fundamental theorem is proved. 
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14. p 3 r = q 3 . 15. 2, 4, -6. 
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10. See (11), §32. 

11. Edges roots of x 3 — 7x 2 + Ylx — v = 0, all real (§45) and irrational. 

14. A = area, c = hypotenuse, squares of legs \{(? i Vc 4 — 16A 2 ). 

15. A area, a, 6 given sides, square third side is a 2 + b 2 ± 2Va 2 b 2 — 4A 2 . 

16. y 4 - 2y 3 + (2 - # 2 )y 2 — 2j/ + 1 = 0, pos. roots 0.09125, 10.95862. 
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1. -5, £(5± v 73 ^)- 2. -6, ±V^3. 

3. -2, lit. 4. |, |(-2± v 73 ^)- 

Page 54 

1. A = -400, one. 2. A = 4 • 27 • 121, three. 

3. A = 0, two. 4. A = 0, two. 

Page 55 

1. -4, 2 i VS. 2. See Ex. 1, §47. 
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3. 1.3569, 1.6920, -3.0489. 4. -1.201639, 1.330058, -3.128419. 
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1. 1, -1, 4±v / 6. 2.-1,-2,2,3. 3. lit, -l±\/2. 

4. 1 i \/2, -1 i v /z 2. 5. 4, -2, -1 i ». 

Page 61 

1. (-3,9). 2. A = -250000, x = 3, -2, ±*. 

3. (3,9), (-2,4). 4. ft = 3. 5. 6.856,7. 

Pages 67 68 

2. 2.1. 

3. (-0.845,4.921), (-3.155,11.079); between -4 and -5. 

4. 1.1, -1.3. 

5. Between and 1, and — 1, 2.5 and 3, —2.5 and —3. 
9. 120(x 3 + j;), 120a; 2 -42. 

Pages 69 70 

1. 3. 2. 2, -2. 3. -1. 

4. Double roots, 1, 3. 5. None. 6. 3, 3, -3, 6. 

Page 72 

3. Use Ex. 3, p. 62, abscissas — 1, 3. 4. Use Ex. 2, p. 62. 

6. y = -15x - 7, X 3 - 15X + 23 = 0. 
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Pages 73 74 



1. One real. 2. (±J |, 7 t " V I)' three reaL 

3. (iVf, -1 T |a/|), three. 4. (-2 ± A 23 t 10^5), one. 



Pages 84 85 

13. y 5 + 2y 4 + 5y 3 + 3y 2 - 2y - 9 = 0. 

14. y 3 + 15y 2 + 52y - 36 = 0. 

Page 88 

1. One, between —2 and —3. 2. One, between 1 and 2. 

Pages 89 90 

1. (-4,-3), (-2,-1), (1,2). 2. (-2,-1), (0,1). 
3. (-2,-1.5), (-1.5,-1), (3,4). 4. (-2,-1), (0,1). 
5. (-7,-6), (1,2). 6. (0,1), (3,4). 

Pages 92 93 

2. 1, 1, 1, 2. 3. 1, 1, -2, -2. 4. 1, 1, two imaginary. 

Page 95 

1. (-2,-1), (0,1), (1,2). 2. (-4,-3), (-2,-1), (1,2). 

Pages 100 101 

1. Single, -2.46955. 2. -1.20164, 1.33006, -3.12842. 
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3. 1.24698, -1.80194, -0.44504. 
5. 3.45592, 21.43067. 



4. ±2.1213203, 2.1231056, -6.1231056. 
6. 2.15443. 



7. -1.7728656, 1.1642479, -3.3913823. 

8. 3.0489173, -1.3568958, -1.6920215. 

9. 2.24004099. 10. 1.997997997. 



11. 1.094551482. 
13. 1.2261. 
15. 0.9397. 
18. 5.46%. 



1. 2. 



12. 2.059, -1.228. 

14. 0.6527 = reciprocal of 2 cos 40°. 
16. 1.3500. 17. 2.7138, 3.3840. 

19. 5.57%. 20. 9.70%. 

Page 105 

2. 3. 

Pages 106 108 



1. 2.24004099. 



2. 2.3593041. 



3. 1.997998. 



Pages 109 110 



1. 132°20.7 / . 



2. 157°12 / . 



3. 4.8425364. 4. 3.1668771. 



5, 7. 15°16^, 85°56^, 212°49 / , 225°57 / . 

6. 72°17 / . 



8. 5°56^, 25°18 / . 9. 2.5541949. 10. 1.85718. 



Page 111 



-1.04727 ±1.13594i. 



f ± }V3i 
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3. -l±i. 4. 1±», l±2t. 5. 2±t, ±2i. 

Pages 111 113 

1. 217°12 / 27.4 // = 3.790988 radians. 2. 42°20 / 47|" doubled. 
3. 133°33.8 / . 4. 108°36 / 14 // . 5. 21.468212. 

6. Angle at center 47°39 / 13 // . 7. 49°17 / 36.5 // . 

8. 1.4303tt, 2.4590tt, 3.4709tt; 257°27 / 12.225 // more exact than first. 

9. x/tt = 0.6625, 1.891, 2.930, 3.948, 4.959. 

10. (i) 0.327739, 0.339224, 1124.333037. 

(ii) 0.250279, 0.894609, 1.127839. 

[Set x = 1 + y, y = 1/z and solve by trigonometry.] 

11. 3.597285. 12. 10, 1.371288. 
13. 0.326878, 12.267305. 14. 324°16 / 29.55 // . 

15. 10 yr. 4 mo. days. 16. 6.074674. 17. 6.13%. 

Page 116 

1. x = 5, y = 6. 2. x = 2, y = 1. 3. x = a, y = 0. 

Page 120 

1. -<Z2&iC3d4 + a2&iC4<i3 + a2^3Cl^4 - a2&3C4^1 ~ 02^4Cl^3 + a2^4C3^1- 

2. +, +. 

Page 126 

3. -3. 4. -8. 
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Pages 129-130 

1. x = —8, y = —7, z = 26. 2. x = 3, y = —5, z — 2. 

3. x = 6, y = 3, z = 12. 4. x = 5, y = 4, z = 3. 

5. x = —5, y = 3, 2 = 2, w = 1. 6. x = 1, y = z = 0, w = — 1. 

Page 133 

1. Consistent: y = —8/7 — 2x, z = 5/7 (common line). 

2. Inconsistent, case (/?). 3. Inconsistent (two parallel planes). 

4. Consistent (single plane). 

5. (i) z = — x — y — 2. (ii) inconsistent. (iii) 



X 


a- 1 


-3 


a + 2' 


y " a + 2 ' 


a - 


" k if fc 


= a or fc = c. 



_ ,., (k-b)(c-k) ,.., fc-c 

6. (l) x = — r. (n) y = x, z 

{a — b){c — a) a — c a — c 

but inconsistent if k is different from a and c. (iii) z = 1 — x — y if k = a, 
inconsistent if k ^ a. 



Page 134 

1. r = 2, x : y : z = —4 : 1 : 1. 2. r = 2, x : y : z = —10 : 8 : 7. 

3. r = 1, two unknowns arbitrary. 4. r = 3, x : y : z : -u> = 6 : 3 : 12 : 1. 

5. r = 2, z = -^-x - ^y, w = ~^§x - ^-y. 

Page 136 

1. Ranks of A and 5 are 2; y = -8/7 - 2x, z = 5/7. 

5 3 45 

2. Consistent only when a = —225/61 and then x = , y = — , z = — . 

J ' 61 61 61 

3. Rank of A is 2, rank of B is 3, inconsistent. 

4. A and B of rank 2, x = 3, y = 2. 
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Pages 140 142 

k(b-k)(c-k)(k + b+c) 

1. x = — —, — ; -, it a, b, c are distinct and not zero and their 

a(b — a)(c — a) (a + b + c) 

sum / 0. If a = b / c, ac / 0, equations are inconsistent unless k — 0, a, c, 

or -a - c, and then y = — - x, z = — f, x arbitrary. 

a(c — a) c(c — a) 

3. (a — b)(b — c)(c — a). 4. (x — y)(y — z)(z — x){xy + yz + zx). 



6. (a + b + c + d)(a + b - c - d)(a -b- c + d)(a -b + c- d). 

7. (a + b + c + d)(a — b + c — d)(a + bi — c — di)(a — bi — c + di) . 



11. Xj = (ki - aj) ■■■(k n - aj) -=- Y[( a s - a j)- 

12. x{ab + ac + be) = —abc. 



s=l 

s+3 



Pages 148 149 

p 4 - 3p 2 q + 5pr + q 2 (5p 2 - 12q)(p 2 - Aq) 13 

J- . . ^. — q T — P' 

r — pq A(p 6 — Apq + 8r) 4 

5. 2p 2 -2q. 6. 24r-p 3 . 

3p 2 g 2 - 4p 3 r - 4g 3 - 2pqr - 9r 2 8. 27r 2 - 9pqr + 2q 3 = 0. 

(r — pg) 2 

10. y = q + r/x. H. x = Ll^M. 

2 + 2y 

12. „ = 4x2+PX + g , see §112. 13. 2g( f + ' M " r) - 5p, see Ex. 17. 

—3a; — p p z q — pr + s 

Page 151 

3. s 2 =p 2 -2q, 4. s 5n = 5.3 n , 

s 3 = P 3, - 3pg, Sfc = if k is not divisible by 5. 

s 4 = p 4 - 4p 2 ^ + 2g 2 , 

S5 =p 5 -5p 3 g + 5pg 2 . 5 - Allzero - 
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Page 155 



2. See Ex. 2, p. 136. 



3. e> {/ \c + yg + e 5 "^' {7^c - V'Q, Q = lc : 



1„2 



4. e^' 7/ic + v^ + e 7 "^' \\c - y/Q, Q = \c 2 - q 7 



(j = 0, 1, 2, 3, 4). 
(j = 0, 1,...,6). 



Page 156 



1. c| - 2ciC3 + 2c4. 
3. C1C3 — 4c4. 



2. c\ c 2 - 2c| - cic 3 + 4c 4 . 
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12. (i) s a s 6 s c s d - Ss a s 6 s c+d + 2T,s a s b+c+d + Ss a+& s c+d - 6s a+&+c+d . 
(ii) ^( s a - 6s^2a + 8s a s 3a + 3s\ a - 6s 4a ). 
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where all but the last term in the main diagonal is 1, and all terms above 
the diagonal are zero except those in the last column. If k > n, we must 
take Cj = (j > n). 
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1. y 2 (16 - y 2 ); y = 0, x = ±3; y = ±4, x = +5. 

2. (c - l) 2 (y 2 - 25)(y 2 - 16). If c / 1, y = ±5, x = 0; y = ±4, x = +3. 
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4. 2±3i, -2±i, ±i. 
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2. pgr — p s — r — 0, x + r/p, x + px + ps/r. 

3. 1, 3, 1 ± *. 11. See Ex. 15, p. 134. 
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